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Preface to the Revised Edition 


Lie Algebras in Particle Physics has been a very successful book. I have 
long resisted the temptation to produce a revised edition. I do so finally, 
because I find that there is so much new material that should be included, 
and so many things that I would like to say slightly differently. On the other 
hand, one of the good things about the first edition was that it did not do too 
much. The material could be dealt with in a one semester course by students 
with good preparation in quantum mechanics. In an attempt to preserve this 
advantage while including new material, I have flagged some sections that 
can be left out in a first reading. The titles of these sections begin with an 
asterisk, as do the problems that refer to them. 

I may be prejudiced, but I think that this material is wonderful fun to 
teach, and to learn. I use this as a text for what is formally a graduate class, 
but it is taken successfully by many advanced undergrads at Harvard. The 
important prerequisite is a good background in quantum mechanics and linear 
algebra. 

It has been over five years since I first began to revise this material and 
typeset it in IATEX. Between then and now, many many students have used the 
evolving manuscript as a text. I am grateful to many of them for suggestions 
of many kinds, from typos to grammar to pedagogy. 

As always, I am enormously grateful to my family for putting up with 
me for all this time. I am also grateful for their help with my inspirational 


epilogue. 
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Why Group Theory? 


Group theory is the study of symmetry. It is an incredible labor saving device. 
It allows us to say interesting, sometimes very detailed things about physical 
systems even when we don't understand exactly what the systems are! When 
I was a teenager, I read an essay by Sir Arthur Stanley Eddington on the 
Theory of Groups and a quotation from it has stuck with me for over 30 
years:! 


We need a super-mathematics in which the operations are as un- 
known as the quantities they operate on, and a super-mathematician 
who does not know what he is doing when he performs these op- 
erations. Such a super-mathematics is the Theory of Groups. 


In this book, I will try to convince you that Eddington had things a little 
bit wrong, as least as far as physics is concerned. A lot of what physicists 
use to extract information from symmetry is not the groups themselves, but 
group representations. You will see exactly what this means in more detail as 
you read on. What I hope you will take away from this book is enough about 
the theory of groups and Lie algebras and their representations to use group 
representations as labor-saving tools, particularly in the study of quantum 
mechanics. 

The basic approach will be to alternate between mathematics and physics, 
and to approach each problem from several different angles. I hope that you 
will learn that by using several techniques at once, you can work problems 
more efficiently, and also understand each of the techniques more deeply. 


lin The World of Mathematics, Ed. by James R. Newman, Simon & Schuster, New York, 
1956. 


1 DOI: 10.1201/9780429499210-1 


Chapter 1 


Finite Groups 


We will begin with an introduction to finite group theory. This is not intended 
to be a self-contained treatment of this enormous and beautiful subject. We 
will concentrate on a few simple facts that are useful in understanding the 
compact Lie algebras. We will introduce a lot of definitions, sometimes prov- 
ing things, but often relying on the reader to prove them. 


1.4 Groups and representations 


A Group, G, is a set with a rule for assigning to every (ordered) pair of 
elements, a third element, satisfying: 


(1.4.1) If f,g € G then h = fg E G. 
(1.4.2) For f, g, h € G, f (gh) = (fg)h. 
(1.A.3) There is an identity element, e, such that for all f € G, ef = 
fe- f. 
(1.A.4) Every element f € G has an inverse, fT}, such that f f~! = 
ff =e. 
Thus a group is a multiplication table specifying g1g2 Vg1,g2 € G. If 
the group elements are discrete, we can write the multiplication table in the 


form 
\ ie} g | go | ++ 


eee (1.1) 
92 || 92 | 9291 | 9292 | --- 
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1.2. EXAMPLE - Z3 3 


A Representation of G is a mapping, D of the elements of G onto a set of 
linear operators with the following properties: 


1.B.1 D(e) = 1, where 1 is the identity operator in the space on which 
the linear operators act. 


1.B.2 D(g1) D(go) = D(g1go), in other words the group multiplica- 
tion law is mapped onto the natural multiplication in the linear 
space on which the linear operators act. 


12 Example - 23 


A group is finite if it has a finite number of elements. Otherwise it is infinite. 
The number of elements in a finite group G is called the order of G. Here is 
a finite group of order 3. 


(1.2) 


This is Z3, the cyclic group of order 3. Notice that every row and column 
of the multiplication table contains each element of the group exactly once. 
This must be the case because the inverse exists. 

An Abelian group in one in which the multiplication law is commutative 


9192 = 9291 - (1.3) 


Evidently, Z3 is Abelian. 
The following is a representation of Z3 


D(e)=1, D(a)=e'™/3, D(b) = etri (1.4) 


The dimension of a representation is the dimension of the space on which 
it acts — the representation (1.4) is 1 dimensional. 
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1.3 The regular representation 


Here's another representation of Z3 


100 0 0 1 
»o- (1 1 jJ. »o-( 0 ] 
0 0 1 010 


This representation was constructed directly from the multiplication ta- 
ble by the following trick. Take the group elements themselves to form an 
orthonormal basis for a vector space, |e), |a), and |b). Now define 


D(g1)lg2) = |9192) (1.6) 


The reader should show that this is a representation. It is called the regular 
representation. Evidently, the dimension of the regular representation is the 
order of the group. The matrices of (1.5) are then constructed as follows. 


le) =e), |e2) 5]la), les) = |b) (1.7) 


[D(g)]i = (el D(g)les) (1.8) 
The matrices are the matrix elements of the linear operators. (1.8) is a 
simple, but very general and very important way of going back and forth from 
operators to matrices. This works for any representation, not just the regular 
representation. We will use it constantly. The basic idea here is just the 
insertion of a complete set of intermediate states. The matrix corresponding 
to a product of operators is the matrix product of the matrices corresponding 
to the operators — 


(1.5) 


(D(g9192)] = [D(91) D(g2) liz 
= (e;|D(91)D(g2)le;) 
= 3 (eiD(gilex) (exID(ga)le;) (1.9) 
k 


= M [D(g]a(D(g2)]kj 
k 


Note that the construction of the regular representation is completely gen- 
eral for any finite group. For any finite group, we can define a vector space in 
which the basis vectors are labeled by the group elements. Then (1.6) defines 
the regular representation. We will see the regular representation of various 
groups in this chapter. 


1.4. IRREDUCIBLE REPRESENTATIONS 5 


1.4 Irreducible representations 


What makes the idea of group representations so powerful is the fact that they 
live in linear spaces. And the wonderful thing about linear spaces is we are 
free to choose to represent the states in a more convenient way by making 
a linear transformation. As long as the transformation is invertible, the new 
states are just as good as the old. Such a transformation on the states produces 
a similarity transformation on the linear operators, so that we can always 
make a new representation of the form 


D(g) > D'(g) = S"! D(g)S (1.10) 


Because of the form of the similarity transformation, the new set of operators 
has the same multiplication rules as the old one, so D’ is a representation if 
D is. D' and D are said to be equivalent representations because they differ 
just by a trivial choice of basis. 

Unitary operators (O such that Ot = O^) are particularly important. A 
representation is unitary if all the D(g)s are unitary. Both the representations 
we have discussed so far are unitary. It will turn out that all representations of 
finite groups are equivalent to unitary representations (we'll prove this later - 
it is easy and neat). 

A representation is reducible if it has an invariant subspace, which 
means that the action of any D(g) on any vector in the subspace is still in 
the subspace. In terms of a projection operator P onto the subspace this con- 
dition can be written as 


PD(g)P = D(g)P Vg EG (1.11) 


For example, the regular representation of Z3 (1.5) has an invariant sub- 


space projected on by 
,fi 11 
P= 3 l.l 1 (1.12) 


1 1 1 


because D(g)P = P Vg. The restriction of the representation to the invariant 
subspace is itself a representation. In this case, it is the trivial representa- 
tion for which D(g) — 1 (the trivial representation, D(g) — 1, is always a 
representation — every group has one). 

A representation is irreducible if it is not reducible. 

A representation is completely reducible if it is equivalent to a represen- 
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tation whose matrix elements have the following form: 


Dig 0 + 
0 Deg) +> (1.13) 


where D;(g) is irreducible Vj. This is called block diagonal form. 
A representation in block diagonal form is said to be the direct sum of 
the subrepresentations, D;(g), 


DiS D- (1.14) 


In transforming a representation to block diagonal form, we are decom- 
posing the original representation into a direct sum of its irreducible com- 
ponents. Thus another way of defining complete reducibility is to say that 
a completely reducible representation can be decomposed into a direct 
sum of irreducible representations. This is an important idea. We will use 
it often. 

We will show later that any representation of a finite group is completely 
reducible. For example, for (1.5), take 


1 1 1 1 
S= 3 l v w (1.15) 
l w w? 
where i 
w = erifà (1.16) 
then 
1 0 0 1 0 0 
Diiey= (0-1 0 D'(a-2|0 w 0 
0 0 w 


1.5 Transformation groups 


There is a natural multiplication law for transformations of a physical system. 
If gı and g> are two transformations, g;go means first do g» and then do gi. 


1.6. APPLICATION: PARITY IN QUANTUM MECHANICS 7 


Note that it is purely convention whether we define our composition law to 
be right to left, as we have done, or left to right. Either gives a perfectly 
consistent definition of a transformation group. 

If this transformation is a symmetry of a quantum mechanical system, 
then the transformation takes the Hilbert space into an equivalent one. Then 
for each group element g, there is a unitary operator D(g) that maps the 
Hilbert space into an equivalent one. These unitary operators form a repre- 
sentation of the transformation group because the transformed quantum states 
represent the transformed physical system. Thus for any set of symmetries, 
there is a representation of the symmetry group on the Hilbert space — we 
say that the Hilbert space transforms according to some representation of the 
group. Furthermore, because the transformed states have the same energy as 
the originals, D(g) commutes with the Hamiltonian, [D(g), H] — 0. As we 
will see in more detail later, this means that we can always choose the energy 
eigenstates to transform like irreducible representations of the group. It is 
useful to think about this in a simple example. 


1.6 Application: parity in quantum mechanics 


Parity is the operation of reflection in a mirror. Reflecting twice gets you 
back to where you started. If p is a group element representing the parity 
reflection, this means that p? = e, Thus this is a transformation that together 
with the identity transformation (that is, doing nothing) forms a very simple 
group, with the following multiplication law: 


Vel» 


Um 
pjipie| 


This group is called Z5. For this group there are only two irreducible rep- 
resentations, the trivial one in which D(p) = 1 and one in which D(e) = 1 
and D(p) — —1. Any representation is completely reducible. In particular, 
that means that the Hilbert space of any parity invariant system can be de- 
composed into states that behave like irreducible representations, that is on 
which D(p) is either 1 or —1. Furthermore, because D(p) commutes with 
the Hamiltonian, D(p) and H can be simultaneously diagonalized. That is 
we can assign each energy eigenstate a definite value of D(p). The energy 
eigenstates on which D(p) — 1 are said to transform according to the trivial 
representation. Those on which D(p) — —1 transform according to the other 
representation. This should be familiar from nonrelativistic quantum me- 
chanics in one dimension. There you know that a particle in a potential that is 
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symmetric about z = 0 has energy eigenfunctions that are either symmetric 
under z — — (corresponding to the trivial representation), or antisymmetric 
(the representation with D(p) = —1). 


17 Example: 55 


The permutation group (or symmetric group) on 3 objects, called S3 where 


a, = (1,2,3) a2 = (3, 2,1) 
Q3 = (1, 2) a4 = (2, 3) a5 = (3, 1) 


(1.19) 


The notation means that a; is acyclic permutation of the things in positions 1, 
2 and 3; a» is the inverse, anticyclic permutation; a3 interchanges the objects 
in positions 1 and 2; and so on. The multiplication law is then determined by 
the transformation rule that gı go means first do gz and then do gj. It is 


XH e | a1 | a2 | as a4 as | 
ar far |ar| e [as las [as | 


(1.20) 


Tan | 
as | as | as |as |a | a2 | € | 


We could equally well define it to mean first do gı and then do g2. These 
two rules define different multiplication tables, but they are related to one 
another by simple relabeling of the elements, so they give the same group. 
There is another possibility of confusion here between whether we are per- 
muting the objects in positions 1, 2 and 3, or simply treating 1, 2 and 3 as 
names for the three objects. Again these two give different multiplication ta- 
bles, but only up to trivial renamings. The first is a little more physical, so we 
will use that. The permutation group is an another example of a transforma- 
tion group on a physical system. 

$3 is non-Abelian because the group multiplication law is not commuta- 
tive. We will see that it is the lack of commutativity that makes group theory 
so interesting. 


1.8. EXAMPLE: ADDITION OF INTEGERS 9 


Here is a unitary irreducible representation of 55 


-1 0 
( " 1) (1.21) 


The interesting thing is that the irreducible unitary representation is more 
than 1 dimensional. It is necessary that at least some of the representations 
of a non-Abelian group must be matrices rather than numbers. Only matri- 
ces can reproduce the non-Abelian multiplication law. Not all the operators 
in the representation can be diagonalized simultaneously. It is this that is 
responsible for a lot of the power of the theory of group representations. 


18 Example: addition of integers 
The integers form an infinite group under addition. 
ry-r-y (1.22) 


This is rather unimaginatively called the additive group of the integers. Since 
this group is infinite, we can't write down the multiplication table, but the 
rule above specifies it completely. 

Here is a representation: 


D(z) = Lo D) (1.23) 


This representation is reducible, but you can show that it is not completely 
reducible and it is not equivalent to a unitary representation. It is reducible 
because 


D(z)P = P (1.24) 
where ue 
P= t o) (1.25) 
However, 
D(z) — P) # (I — P) (1.26) 


so it is not completely reducible. 
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The additive group of the integers is infinite, because, obviously, there are 
an infinite number of integers. For a finite group, all reducible representations 
are completely reducible, because all representations are equivalent to unitary 
representations. 


1.9 Useful theorems 


Theorem 1.1 Every representation of a finite group is equivalent to a unitary 
representation. 


Proof: Suppose D(g) is a representation of a finite group G. Construct the 
operator 
S = Y D(g)! D(g) (1.27) 
geG 
S is hermitian and positive semidefinite. Thus it can be diagonalized and its 
eigenvalues are non-negative: 


S -U-!qU (1.28) 
where d is diagonal 
d 0 -> 
d=| 0 d --- (1.29) 


where dj > 0 Vj. Because of the group property, all of the d;s are actually 
positive. Proof — suppose one of the djs is zero. Then there is a vector A 
such that SA = 0. But then 


ASA =0= Y, IIDA. (1.30) 
gEG 


Thus D(g)À must vanish for all g, which is impossible, since D(e) = 1. 
Therefore, we can construct a square-root of S that is hermitian and invertible 


vd, 0 
x-g?zzpy3| 0 Vd -|y (1.31) 


X is invertible, because none of the d;s are zero. We can now define 


D'(g) = X D(g) X~* (1.32) 
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Now, somewhat amazingly, this representation is unitary! 
D'(g)! D'(g) = X"! D(g)! SD(g)X ^ (1.33) 
but 


D(g)! SD(g) = D(g)! (E zr) D(g) 


heG 
= V D(hg)! D(hg) (1.34) 
heG 
= M D(h)'D(h) = S = X? 


heG 


where the last line follows because hg runs over all elements of G when A 
does. QED. 

We saw in the representation (1.23) of the additive group of the integers 
an example of a reducible but not completely reducible representation. The 
way it works is that there is a P that projects onto an invariant subspace, but 
(1 — P) does not. This is impossible for a unitary representation, and thus 
representations of finite groups are always completely reducible. Let's prove 
it. 


Theorem 1.2 Every representation of a finite group is completely reducible. 


Proof: By the previous theorem, it is sufficient to consider unitary repre- 
sentations. If the representation is irreducible, we are finished because it is 
already in block diagonal form. If it is reducible, then 3 a projector P such 
that PD(g)P = D(g)P Vg € G. This is the condition that P be an invariant 
subspace. Taking the adjoint gives PD(g)! P — PD(g)! Vg € G. But be- 
cause D(g) is unitary, D(g)! = D(g)~! = D(g7!) and thus since g^! runs 
over all G when g does, PD(g)P = PD(g) Vg € G. But this implies that 
(1— P)D(g)(1 — P) = D(g)(1 — P) Vg € G and thus 1 — P projects onto 
an invariant subspace. Thus we can keep going by induction and eventually 
completely reduce the representation. 


1.10 Subgroups 


A group H whose elements are all elements of a group G is called a subgroup 
of G. The identity, and the group G are trivial subgroups of G. But many 
groups have nontrivial subgroups (which just means some subgroup other 
than G or e) as well. For example, the permutation group, 53, has a Za 
subgroup formed by the elements {e, a1, a2}. 
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We can use a subgroup to divide up the elements of the group into subsets 
called cosets. A right-coset of the subgroup H in the group G is a set of 
elements formed by the action of the elements of H on the left on a given 
element of G, that is all elements of the form Hg for some fixed g. You can 
define left-cosets as well. 

For example, {a3,a4,a5} is a coset of Z3 in S3 in (1.20) above. The 
number of elements in each coset is the order of H. Every element of G 
must belong to one and only one coset. Thus for finite groups, the order of 
a subgroup H must be a factor of order of G. It is also sometimes useful to 
think about the coset-space, G/H defined by regarding each coset as a single 
element of the space. 

A subgroup H of G is called an invariant or normal subgroup if for 
every g E G 


gH = Hg (1.35) 


which is (we hope) an obvious short-hand for the following: for every g € G 
and hı € H there exists an ho € H such that hig = gho, or ghog™! = hy. 
The trivial subgroups e and G are invariant for any group. It is less ob- 
vious but also true of the subgroup Z3 of S3 in (1.20) (you can see this 
by direct computation or notice that the elements of Z3 are those permuta- 
tions that involve an even number of interchanges). However, the set {e, a4} 
is a subgroup of G which is not invariant. as(e,a4) = {a5,a,} while 
(e, a4 }a5 = (as, az}. 

If H is invariant, then we can regard the coset space as a group. The 
multiplication law in G gives the natural multiplication law on the cosets, 
Hg: 

(Hgi)(Hgz) = (Hg Hgy )(192) (1.36) 


But if H is invariant Hg; Hg, ! — H, so the product of elements in two 
cosets is in the coset represented by the product of the elements. In this case, 
the coset space, G/H, is called the factor group of G by H. 

What is the factor group 53/23? The answer is Z2. 

The center of a group G is the set of all elements of G that commute 
with all elements of G. The center is always an Abelian, invariant subgroup 
of G. However, it may be trivial, consisting only of the identity, or of the 
whole group. 

There is one other concept, related to the idea of an invariant subgroup, 
that will be useful. Notice that the condition for a subgroup to be invariant 
can be rewritten as 


gHg | -HVgcG (1.37) 
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This suggests that we consider sets rather than subgroups satisfying same 
condition. 
g`’ Sg-SVgeG (1.38) 


Such sets are called conjugacy classes. We will see later that there is a one- 
to-one correspondence between them and irreducible representations. A sub- 
group that is a union of conjugacy classes is invariant. 
Example — 
The conjugacy classes of 55 are {e}, (a1, a2} and (a3, a4, a5}. 
The mapping 
G—gGg^! (1.39) 


for a fixed g is also interesting. It is called an inner automorphism. An 
isomorphism is a one-to-one mapping of one group onto another that pre- 
serves the multiplication law. An automorphism is a one-to-one mapping 
of a group onto itself that preserves the multiplication law. It is easy to see 
that (1.39) is an automorphism. Because g^1g19 g^ 1gog = 9^ 1g19ag. it pre- 
serves the multiplication law. Since g^ ^g1g = g 929 => g1 = go, it is one 
to one. Àn automorphism of the form (1.39) where g is a group element is 
called an inner automorphism). An outer automorphism is one that cannot 
be written as g^ 1 Gg for any group element g. 


1.[11 Schur's lemma 


Theorem 1.3 /f Di(g) A = ADo(g) Vg € G where D and D» are inequiv- 
alent, irreducible representations, then A = 0. 


Proof: This is part of Schur’s lemma. First suppose that there is a vector |u) 
such that Ajj) = 0. Then there is a non-zero projector, P, onto the subspace 
that annihilates A on the right. But this subspace is invariant with respect to 
the representation D2, because 


AD2(9)P = Di(g)AP =0Vg EG (1.40) 


But because D» is irreducible, P must project onto the whole space, and 
A must vanish. If A annihilates one state, it must annihilate them all. A 
similar argument shows that A vanishes if there is a (v| which annihilates 
A. If no vector annihilates A on either side, then it must be an invertible 
square matrix. It must be square, because, for example, if the number of 
rows were larger than the number of columns, then the rows could not be a 
complete set of states, and there would be a vector that annihilates A on the 
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right. A square matrix is invertible unless its determinant vanishes. But if the 
determinant vanishes, then the set of homogeneous linear equations 


Alu) — 0 (1.41) 


has a nontrivial solution, which again means that there is a vector that anni- 
hilates A. But if A is square and invertible, then 


A Di(g)À = D»(g) Vg E G (1.42) 


so D, and D» are equivalent, contrary to assumption. QED. 

The more important half of Schur's lemma applies to the situation where 
Dı and Dz above are equivalent representations. In this case, we might as 
well take Dı = D» = D, because we can do so by a simple change of basis. 
The other half of Schur's lemma is the following. 


Theorem 1.4 If D(g) A = AD(g) Vg € G where D is a finite dimensional 
irreducible representation, then A x I. 


In words, if a matrix commutes with all the elements of a finite dimensional 
irreducible representation, it is proportional to the identity. 
Proof: Note that here the restriction to a finite dimensional representation 
is important. We use the fact that any finite dimensional matrix has at least 
one eigenvalue, because the characteristic equation det(A — AI) = 0 has at 
least one root, and then we can solve the homogeneous linear equations for 
the components of the eigenvector |). But then D(g)(A — AI) = (A — 
AI)D(g) Vg € G and (A — AI)|u) = 0. Thus the same argument we used in 
the proof of the previous theorem implies (A — AJ) = 0. QED. 

A consequence of Schur's lemma is that the form of the basis states of an 
irreducible representation are essentially unique. We can rewrite theorem 1.4 
as the statement 


A^"!D(g)JA- D(g) Vg G-— A«I (1.43) 


for any irreducible representation D. This means once the form of D is fixed, 
there is no further freedom to make nontrivial similarity transformations on 
the states. The only unitary transformation you can make is to multiply all 
the states by the same phase factor. 

In quantum mechanics, Schur's lemma has very strong consequences for 
the matrix elements of any operator, O, corresponding to an observable that 
is invariant under the symmetry transformations. This is because the matrix 
elements (a, j, z|O|b, k, y) behave like the A operator in (1.40). To see this, 
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let’s consider the complete reduction of the Hilbert space in more detail. The 
symmetry group gets mapped into a unitary representation 


g D(g) Vg EG (1.44) 


where D is the (in general very reducible) unitary representation of G that 
acts on the entire Hilbert space of the quantum mechanical system. But if the 
representation is completely reducible, we know that we can choose a basis 
in which D has block diagonal form with each block corresponding to some 
unitary irreducible representation of G. We can write the orthonormal basis 


states as 
la, j, x) (1.45) 


satisfying 
(a,j, x | b,k,y) = dab Ójk Oxy (1.46) 


where a labels the irreducible representation, 7 = 1 to n, labels the state 
within the representation, and z represents whatever other physical parame- 
ters there are. 

Implicit in this treatment is an important assumption that we will almost 
always make without talking about it. We assume that have chosen a basis in 
which all occurences of each irreducible representation a, is described by the 
same set of unitary representation matrices, D4(g). In other words, for each 
irreducible representation, we choose a canonical form, and use it exclusively 

In this special basis, the matrix elements of D(g) are 


(a, j, z| D(g) |b, k, y) = Sab Sey [Da(9)] js (1.47) 


This is just a rewriting of (1.13) with explicit indices rather than as a matrix. 
We can now check that our treatment makes sense by writing the representa- 
tion D in this basis by inserting a complete set of intermediate states on both 
sides: 

T= la j2)(2.5.2| (1.48) 


a,j, 


Then we can write 


D(g) = Y la, j, 2) (a, 3,2] D(g) Y; |b, k, y)(b, k, yl 


0,2, b,k,y 
= 2, la, j, x) Sab Oxy [Da(9)] ji (b, k, yl (1.49) 
b,k,y 


= Y. |a, $2) [Da(9)]; (a, k, zl 


a,j,k,x 
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This is another way of writing a representation that is in block diagonal form. 
Note that if a particular irreducible representation appears only once in D, 
then we don't actually need the x variable to label its states. But typically, 
in the full quantum mechanical Hilbert space, each irreducible representation 
will appear many times, and then the physical x variable distinguish states 
that have the same symmetry properties, but different physics. The important 
fact, however, is that the dependence on the physics in (1.47) is rather trivial 
— only that the states are orthonormal — all the group theory is independent 
of x and y. 
Under the symmetry transformation, since the states transform like 


lu)  D(g)) (al (4 Dg) (1.50) 
operators transform like 
O + D(g) O D(g)! (1.51) 


in order that all matrix element remain unchanged. Thus an invariant observ- 
able satisfies 


O ^ D(g) O D(g)' =O (1.52) 
which implies that O commutes with D(g) 
[O, D(g)] =0Vg EG. (1.53) 


Then we can constrain the matrix element 
(a, j, 2|O|b, k, y) (1.54) 
by arguing as follows: 
0 = (a,j, z|[O, D(g)]|b, k, y) 


a Sila, zol, k',y)(b,k', y|D(g)|b, k, y) (1.55) 


X a, j, | D(g)]a, j', x) (a; 9’, z|O|b, k, y) 

Now we use (1.47), which exhibits the fact that the matrix elements of D(g) 
have only trivial dependence on the physics, to write 
0= (a, j, z|[O, D(g)]|b, k, y) 

- 2 (0.3, 21016, K',y)[Do( 9) lark 


2» a(g)lj5( a ds z|O|b, k, y) 


(1.56) 
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Thus the matrix element (1.54) satisfies the hypotheses of Schur's lemma. It 
must vanish if a Æ b. It must be proportional to the identity (in indices, that 
is 6;,) for a = b. However, the symmetry doesn't tell us anything about the 
dependence on the physical parameters, x and y. Thus we can write 


(a, j, z|Olb, k, y) = fa(z,y) bab Ójk (1.57) 


The importance of this is that the physics is all contained in the function 
falz, y) — all the dependence on the group theory labels is completely fixed 
by the symmetry. As we will see, this can be very powerful. This is a simple 
example of the Wigner-Eckart theorem, which we will discuss in much more 
generality later. 


1.12 * Orthogonality relations 


The same kind of summation over the group elements that we used in the 
proof of theorem 1.1, can be used together with Schur's lemma to show some 
more remarkable properties of the irreducible representations. Consider the 
following linear operator (written as a "dyadic") 


A% = V. D«(g !)|a, j) (b | Do(g) (1.58) 
gEG 


where Da and D, are finite dimensional irreducible representations of G. 
Now look at 


«(91) 45? = X` D«(g1) Do (9 |a, j) (b, £|Di(9) (1.59) 
gcG 
= Y Da(gig™")|a, j} (b, 4| Ds(g) (1.60) 
gEG 
= M Da((g97')~*) la, 3) (b, £| D (9) (1.61) 
gEG 


Now let g' = gg]! 


= V Da(g'™™)]a, j} (b, |Dv(g gi) (1.62) 
g'EG 
= Y Da(g'*)|a, 3) (b, €|Do(g')Do(gr) = A$ Do (g1) (1.63) 


g'€G 
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Now Schur's lemma (theorems 1.3 and 1.4) implies Av = 0 if D, and 
D, are different, and further that if they are the same (remember that we have 
chosen a canonical form for each representation so equivalent representations 
are written in exactly the same way) A9 x I. Thus we can write 


AZ = Y D«(g )la; j) (5 €] Do(g) = Sab Ṣe I (1.64) 
gEG 


To compute A5,, compute the trace of AY y (in the Hilbert space, not the in- 
dices) in two different ways. We can aie 


Tr A2 2 = bap Tr (AX I) = ab Ad, Tr I = Sap Ato Nna (1.65) 


where n, is the dimension of D,. But we can also use the cyclic property of 
the trace and the fact that Ag. x dap to write 


Tr AY = = bab X (a, 4D«s(g) D«(g ^)la. 7) m N bap je (1.66) 
geG 
where N is the order of the group. Thus A$, = NÓj, [Na and we have shown 
EC N 
>, D«(9 la, j) (b, €| Dog) = — bab Sjel (1.67) 
gEG Na 


Taking the matrix elements of these relations yields orthogonality relations 
for the matrix elements of irreducible representations. 


n = 
D Fy Dolo Dues LDo(a)lem = 5a05jedem (1.68) 
gEG 
For unitary irreducible representations, we can write 
n 
ps "v Pa (9) s Do (9)]em = 60b0je0km (1.69) 
geG 
so that with proper normalization, the matrix elements of the inequivalent 
unitary irreducible representations 


"Ds GL (1.70) 


are orthonormal functions of the group elements, g. Because the matrix ele- 
ments are orthonormal, they must be linearly independent. We can also show 
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that they are a complete set of functions of g, in the sense that an arbitrary 
function of g can be expanded in them. An arbitrary function of g can be writ- 
ten in terms of a bra vector in the space on which the regular representation 
acts: 


F(g) = (F|g) = (F|Dn(g)le) (1.71) 
where 
=  F(g)gl (1.72) 
gcG 


and Dg is the regular representation. Thus an arbitrary F(g) can be written 
as a linear combination of the matrix elements of the regular representation. 


= Y, F(g)(gIDs(g)le) = Yo F(GDtDa() HS) 


g CG g EG 


But since Dg is completely reducible, this can be rewritten as a linear com- 
bination of the matrix elements of the irreducible representations. Note that 
while this shows that the matrix elements of the inequivalent irreducible rep- 
resentations are complete, it doesn’t tell us how to actually find what they are. 
The orthogonality relations are the same. They are useful only once we ac- 
tually know explicitly what the representation look like. Putting these results 
together, we have proved 


Theorem 1.5 The matrix elements of the unitary, irreducible representations 
of G are a complete orthonormal set for the vector space of the regular rep- 
resentation, or alternatively, for functions of g € G. 


An immediate corollary is a result that is rather amazing: 


N= on} (1.74) 


— the order of the group N is the sum of the squares of the dimensions of the 
irreducible representations n; just because this is the number of components 
of the matrix elements of the irreducible representations. You can check that 
this works for all the examples we have seen. 

Example: Fourier series — cyclic group Zw with elements a; for j = 
0 to N — 1 (with ag = e) 


ajQk = Q(j--k) mod N (1.75) 
The irreducible representations of Zy are 


D,,(aj;) = e?rinilN (1.76) 
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all 1-dimensional.! Thus (1.69) gives 


i 
+ Ye —2nin'j/N g2ninj/N _ = nie (1.77) 


which is the fundamental relation for Fourier series. 


1.13 Characters 


The characters yp(g) of a group representation D are the traces of the linear 
operators of the representation or their matrix elements: 


xp(g) = Tr D(g) = Y '(D(g)la (1.78) 


i 


The advantage of the characters is that because of the cyclic property of the 
trace Tr(AB) = Tr(B A), they are unchanged by similarity transformations, 
thus all equivalent representations have the same characters. The characters 
are also different for each inequivalent irreducible representation, Da — in 
fact, they are orthonormal up to an overall factor of N — to see this just sum 
(1.69) over j = k and £ = m 


1 1 
5 yP (9) lfx{Do(g)lem = Y; z 9abÓjtÓkm = Sab 
pon is 


or 


T: =E XD. (9)* XD, (9) = Sas (1.79) 
N e 
Since the characters of different irreducible representations are orthogonal, 
they are different. 
The characters are constant on conjugacy classes because 


Tr D(g!gig) = Tr(D(g !) D(g1) D(g)) = Tr D(g:) (1.80) 


It is less obvious, but also true that the characters are a complete basis for 
functions that are constant on the conjugacy classes and we can see this by 
explicit calculation. Suppose that F'(gi) is such a function. We already know 


'We will prove below that Abelian finite groups have only 1-dimensional irreducible rep- 
resentations. 
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that F'(g1) can be expanded in terms of the matrix elements of the irreducible 
representations — 


F(g1) = X c%[Dal(n)] je (1.81) 
a,j,k 


but since F is constant on conjugacy classes, we can write it as 


1 1 
Flg) = 7 2 Fl ng) = 7 2, Ds nol. (1.82) 


geG a,j,k 
and thus 
1 a —1 
Flg) = y Le clDs(g eDa(g]eniDa(s))ge — — 0.85 
a,j,k 
g,é,m 


But now we can do the sum over g explicitly using the orthogonality relation, 
(1.68). 
1 a 
F(n) =>, "uL (91)]tmÓjkÓtm (1.84) 


ajk à 


l,m 
or i : 
F(n)*,-dD«m)e-,—cjx«m) (185 
a,j, Na aj Na 
This was straightforward to get from the orthogonality relation, but it has an 
important consequence. The characters, x4 (g), of the independent irreducible 
representations form a complete, orthonormal basis set for the functions that 
are constant on conjugacy classes. Thus the number of irreducible representa- 
tions is equal to the number of conjugacy classes. We will use this frequently. 
This also implies that there is an orthogonality condition for a sum over 
representations. To see this, label the conjugacy classes by an integer a, and 
let kg be the number of elements in the conjugacy class. Then define the 
matrix V with matrix elements 


| ka 
Vaa = Ayr. XD (Ja) (1.86) 


where ga is the conjugacy class œ. Then the orthogonality relation (1.79) can 
be written as VIV = 1. But V is a square matrix, so it is unitary, and thus we 
also have VV! = 1, or 


N 
2 xp. (9) xp. (98) 1525 (1.87) 
a a 
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Consequences: Let D be any representation (not necessarily irreducible). In 
its completely reduced form, it will contain each of the irreducible represen- 
tations some integer number of times, Ma. We can compute m, simply by 
using the orthogonality relation for the characters (1.79) 


1 
x 22 Xp. (9) xp(g) = m (1.88) 
geG 
The point is that D is a direct sum 


mP times 
3 Da- 6D, (1.89) 
a 
For example, consider the regular representation. It’s characters are 
xr(e)=N  xn(g) =0 forg ze (1.90) 
Thus 
mË = Xale) = Na (1.91) 


Each irreducible representation appears in the regular representation a num- 
ber of times equal to its dimension. Note that this is consistent with (1.74). 
Note also that m, is uniquely determined, independent of the basis. 
Example: Back to $4 once more. Let's determine the characters without 
thinking about the 2-dimensional representation explicitly, but knowing the 
conjugacy classes, (e), (a1,a2) and (a3,a4,as5). It is easiest to start with 
the one representation we know every group has — the trivial representa- 
tion, Do for which Do(g) = 1 for all g. This representation has characters 
xo(g) = 1. Note that this is properly normalized. It follows from the con- 
dition 3; n2 = N that the other two representations have dimensions 1 and 
2. It is almost equally easy to write down the characters for the other 1- 
dimensional representation. In general, when there is an invariant subgroup 
H of G, there are representations of G that are constant on H, forming a 
representation of the factor group, G/H. In this case, the factor group is Z2, 
with nontrivial representation 1 for H = (e,a1,a5) and —1 for {a3, a4, a5}. 
We know that for the 2 dimensional representation, xa(e) = n3 = 2, thus so 
far the character table looks like 


a3 
a, | a4 
e | ag} a5 


(1.92) 
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But then we can fill in the last two entries using orthogonality. We could 
actually have just used orthogonality without even knowing about the second 
representation, but using the Z2 makes the algebra trivial. 


a3 
1 a4 
e | a2 as 


(1.93) 


We can use the characters not just to find out how many irreducible rep- 
resentations appear in a particular reducible one, but actually to explicitly 
decompose the reducible representation into its irreducible components. It is 
easy to see that if D is an arbitrary representation, the sum 


P, = $ Y xp. (9)* D(9) (1.94) 
geG 


is a projection operator onto the subspace that transforms under the represen- 
tation a. To see this, note that if we set 7 = k and sum in the orthogonality 
relation (1.69), we find 
x XD. (9)* [Do (g)]em = Sab5em (1.95) 

geG 
Thus when D is written in block diagonal form, the sum in (1.95) gives 1 
on the subspaces that transform like D, and 0 on all the rest — thus it is the 
projection operator as promised. The point, however, is that (1.94) gives us 
the projection operator in the original basis. We did not have to know how to 
transform to block diagonal form. An example may help to clarify this. 
Example — 53 again 

Here's a three dimensional representation of $5 


1 0 0 0 0 1 
Da(e) = (o 1 o) , D3(a1) = ( 0 o) 
0 0 1 0 1 0 
0 1 0 0 1 0 
D3(a2) = ( 0 ] ; D3(a3) = ( 0 o) (1.96) 
1 0 0 00 1 
1 0 0 0 0 1 
D3(a4) = (o 0 i " Ds(as) = t 1 | 
0 1 0 1 0 0 
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More precisely, as usual when we write down a set of matrices to represent 
linear operators, these are matrices which have the same matrix elements — 
that is 

[Ds(g)]jk = G|Ds|k) (1.97) 
One could use a different symbol to represent the operators and the matrices, 
but its always easy to figure out which is which from the context. The impor- 
tant point is that the way this acts on the states, |7) is by matrix multiplication 
on the ides because we can insert a inis set of intermediate states 


= 2H (k|D3(g = 2/9 )[Ds(g (1.98) 


This particular representation is an important one because it is the defin- 
ing representation for the group — it actually implements the permutations 
on the states. For example 


Ds(a1)]1) = DNA) Daas) = |2) 
s(a1)[2) -X» [Ds(a1)]k2  |3) (1.99) 
3(a1)]3) = -Yk [D3(a1)]e3 = |1) 


thus this implements the cyclic transformation (1,2,3), or 1 2 243-1. 
Now if we construct the projection operators, we find 


1 5 1 1 1 1 
P= 6 [Ds P = 3 k 1 j (1.100) 


111 


1 2 5 
P,- 6 e + Y. D3(a;) = 5 2) =0 (1.101) 
j=l 
9 TI E 
P, = P 2D3(e -yp aj) | = 5 -1 2 -1 (1.102) 


-1 -1 2 
This makes good sense. 5 projects onto the invariant combination (|1) + 
[2) + |3))//3, which transforms trivially, while P) projects onto the two 
dimensional subspace spanned by the differences of pairs of components, 
|1} — |2}, etc, which transforms according to D3. 
This constructions shows that the representation D3 decomposes into a 
direct sum of the irreducible representations, 


D3 = Do ® D2 (1.103) 
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1.14 Eigenstates 


In quantum mechanics, we are often interested in the eigenstates of an invari- 
ant hermitian operator, in particular the Hamiltonian, H. We can always take 
these eigenstates to transform according to irreducible representations of the 
symmetry group. To prove this, note that we can divide up the Hilbert space 
into subspaces with different eigenvalues of H. Each subspace furnishes a 
representation of the symmetry group because D(g), the group represen- 
tation on the full Hilbert space, cannot change the H eigenvalue (because 
[D(g),H] — 0). But then we can completely reduce the representation in 
each subspace. 

A related fact is that if some irreducible representation appears only once 
in the Hilbert space, then the states in that representation must be eigenstates 
of H (and any other invariant operator). This is true because H |a, j, £} must 
be in the same irreducible representation, thus 


H |a,j,2) = M, ey la j v) (1.104) 
y 


and if z and y take only one value, then |a, j, x) is an eigenstate. 
This is sufficiently important to say again in the form of a theorem: 


Theorem 1.6 If a hermitian operator, H, commutes with all the elements, 
D(g), of a representation of the group G, then you can choose the eigen- 
states of H to transform according to irreducible representations of G. If 
an irreducible representation appears only once in the Hilbert space, every 
state in the irreducible representation is an eigenstate of H with the same 
eigenvalue. 


Notice that for Abelian groups, this procedure of choosing the H eigen- 
states to transform under irreducible representations is analogous to simulta- 
neously diagonalizing H and D(g). For example, for the group Z» associated 
with parity, it is the statement that we can always choose the H eigenstates to 
be either symmetric or antisymmetric. 

In the case of parity, the linear operator representing parity is hermitian, 
so we know that it can be diagonalized. But in general, while we have shown 
that operators representing finite group elements can be chosen to be unitary, 
they will not be hermitian. Nevertheless, we can show that for an Abelian 
group that commutes with the H, the group elements can simultaneously 
diagonalized along with H. The reason is the following theorem: 


Theorem 1.7 All of the irreducible representations of a finite Abelian group 
are I-dimensional. 
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One proof of this follows from our discussion of conjugacy classes and from 
(1.74). For an Abelian group, conjugation does nothing, because g g/ g^ = 
g' for all g and g’. Therefore, each element is in a conjugacy class all by itself. 
Because there is one irreducible representation for each conjugacy class, the 
number of irreducible representations is equal to the order of the group. Then 
the only way to satisfy (1.74) is to have all of the n;s equal to one. This proves 
the theorem, and it means that decomposing a representation of an Abelian 
group into its irreducible representations amounts to just diagonalizing all the 
representation matrices for all the group elements. 

For a non-Abelian group, we cannot simultaneously diagonalize all of 
the D(g)s, but the procedure of completely reducing the representation on 
each subspace of constant H is the next best thing. 

A classical problem which is quite analogous to the problem of diago- 
nalizing the Hamiltonian in quantum mechanics is the problem of finding the 
normal modes of small oscillations of a mechanical system about a point of 
stable equilibrium. Here, the square of the angular frequency is the eigen- 

„value of the M~!K matrix and the normal modes are the eigenvectors of 
M-!K. In the next three sections, we will work out an example. 


1.15 Tensor products 


We have seen that we can take reducible representations apart into direct 
sums of smaller representations. We can also put representations together 
into larger representations. Suppose that D is an m dimensional representa- 
tion acting on a space with basis vectors |j) for 7 = 1 to m and Dg is ann 
dimensional representation acting on a space with basis vectors |x) for x = 1 
to n. We can make an m x n dimensional space called the tensor product 
space by taking basis vectors labeled by both 7 and z in an ordered pair — 
|j, z). Then when j goes from 1 to m and z goes from 1 to n, the ordered 
pair (7, £) runs over m x n different combinations. On this large space, we 
can define a new representation called the tensor product representation 
D ® D» by multiplying the two smaller representations. More precisely, the 
matrix elements of Dp, @p,(g) are products of those of Dı (g) and D»(g): 


(j, z| Do, @p2(9) |k, y) = (3| Di(g) |k) (| D2(9) |y) (1.105) 


It is easy to see that this defines a representation of G. In general, however, 
it will not be an irreducible representation. One of our favorite pastimes in 
what follows will be to decompose reducible tensor product representations 
into irreducible representations. 
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1.16 Example of tensor products 


Consider the following physics problem. Three blocks are connected by 
springs in a triangle as shown 


(1.106) 


Suppose that these are free to slide on a frictionless surface. What can we 
say about the normal modes of this system. The point is that there is an S5 
symmetry of the system, and we can learn a lot about the system by using the 
symmetry and applying theorem 1.6. The system has 6 degrees of freedom, 
described by the x and y coordinates of the three blocks: 


(z yl 2 yo T3 ys) (1.107) 


This has the structure of a tensor product — the 6 dimensional space is a 
product of a 3 dimensional space of the blocks, and the 2 dimensional space 
of the z and y coordinates. We can think of these coordinates as having two 
indices. It is three two dimensional vectors, 7;, each of the vector indices has 
two components. So we can write the components as r;,, where 7 labels the 
mass and runs from 1 to 3 and y labels the z or y component and runs from 
1 to 2, with the connection 

(zii yl T2 yo 23 V3)— (1.108) 

(riui Ti2 Tz) 122 T31 132) 


The 3 dimensional space transforms under S3 by the representation D3. The 
2 dimensional space transforms by the representation D» below: 


-i x3 n: 
D»(a2) = (À A ; Do(a3) = ( i d , (1.109) 
123.2329 
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This is the same as (1.21). Then, using (1.105), the 6 dimensional represen- 
tation of the coordinates is simply the product of these two representations: 


[De(9)]jukv = [D;(9)];k [D2 (9)]u» (1.110) 
Thus, for example, 
Dv ^. dh o sixes 
Q. 6: 0 — 0 XE <} 
-b -3 0 0 0 0 
De(a)-| 2 2? (1.111) 
is 3 1 9 0 0 0 
0 0 -1-23 9 0 
0 0 3 -i 0 0 


This has the structure of 3 copies of D2(a1) in place of the 1’s in Ds(a). 
The other generators are similar in structure. 

Because the system has the $5 symmetry, the normal modes of the sys- 
tem must transform under definite irreducible representations of the symme- 
try. Thus if we construct the projectors onto these representations, we will 
have gone some way towards finding the normal modes. In particular, if an 
irreducible representation appears only once, it must be a normal mode by 
theorem 1.6. If a representation appears more than once, then we need some 
additional information to determine the modes. 

We can easily determine how many times each irreducible representation 
appears in Dg by finding the characters of Dg and using the orthogonality 
relations. To find the characters of Dg, we use an important general result. 
The character of the tensor product of two representations is the product of 
the characters of the factors. This follows immediately from the definition of 
the tensor product and the trace. 


XDixD2 = XD; XDo (1.112) 


So in this case, 


xe(g) = X [Do(9) juin 
ju (1.113) 


= [D3(9)];j[D2(9)]up = x3(9)x2(9) 
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so that the product is as shown in the table below: 


(1.114) 


This is the same as the characters of the regular representation, thus this rep- 
resentation is equivalent to the regular representation, and contains Dp and 


Dy, once and D» twice. 
Note that (1.113) is an example of a simple but important general relation, 


which we might as well dignify by calling it a theorem — 
Theorem 1.8 The characters of a tensor product representation are the prod- 
ucts of the characters of the factors. 

With these tools, we can use group theory to find the normal modes of 
the system. 


1.17 * Finding the normal modes 


The projectors onto Do and D; will be 1 dimensional. 


Po is 
1 
P = 6 >= xolg)“ De(g) 
geG 
1 v3 —1i YB Q0 -8 
AFI 
42 aes n ae. UU ux (1.115) 
"Rua DE a8 
= 4 12 4 12 6 
MB 4 39 —L Q0 -£ 
12 12 12 12 6 
0 0 0 0 0 0 
AS. lb o 3X8 01 Q 2 
6 6 6 6 3 
l 
2 
v3 
$ 
=| 2 (i od A di -4,) (1.116) 
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corresponding to the motion 


(1.117) 


the so-called "breathing mode" in which the triangle grows and shrinks while 


retaining its shape. 


P is 
1 
P = "PESO Ds(g) 
gEG 
d: 8 14 xX QI 8 
3 i: EE "i Và 
ee 4eXP. edo Wo ig (1.118) 
do £L 1 {xX LIS ọọ 
= 12 12 12 12 6 
3 2l B 1  .yX3 p 
12 4 12 4 6 
ak £ l^ awe 1 qq 
6 6 6 6 3 
0 0 0 0 0 0 
_ v3 
6 
1 
i 
NE (-4 | -@ -1 à 0) (1.119) 
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corresponding to the motion 


(1.120) 


the mode in which the triangle rotates — this is a normal mode with zero 
frequency because there is no restoring force. 

Notice, again, that we found these two normal modes without putting in 
any physics at all except the symmetry! 

Finally, P5 is 


2 
Bez >> x2(g)* De(g) 
geG 
2 0 1l. Ww l v3 
3 6 6 B. 6 
2 3 1 3 1 
0 8 $ 1d -$ I| (1.121) 
H v3 2 0 1 _ v3 
= 6 6 3 6 6 
—YX3 1 0 2 v3 1 
6 6 3 6 6 
1 2x% 1 v3 2 0 
6 6 6 6 3 
v3 1 x43 1 0 2 
6 6 6 6 3 


As expected, this is a rank 4 projection operator (Tr P? = 4). We need some 
dynamical information. Fortunately, two modes are easy to get — translations 
of the whole triangle. 

Translations in the z direction, for example, are projected by 


1 

1021010 

00°00 0 0 

1 1 1 
T; = M E e (1.122) 

303030 

000000 
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and those in the y direction by 


000000 
05 03 0 3 
000000 
M=)o 20102 (1.123) 
000000 
05; 0 3 0 1i 
So the nontrivial modes are projected by 
P — T; —Ty = 
1 0 -l _vB l £B 
3 6 6 6 6 
001 $ a -F d 
3 6 
-1 x% 1 0 -l1 _¥3 (1.124) 
6 6 3 6 6 
-%3 l 0 1 v3 1 
6 6 3 6 6 
-l .Y43 l £B 1 0 
6 6 6 6 3 
43. 1 `X% 1 p 1 
6 6 6 6 3 


To see what the corresponding modes look like, act with this on the vector 
(0 0 0 0 O 1)to get 


£08 3494) (1.125) 


(1.126) 


Then rotating by 27/3 gives a linearly independent mode. 
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1.18 * Symmetries of 2n+1-gons 


This is a nice simple example of a transformation group for which we can 
work out the characters (and actually the whole set of irreducible representa- 
tions) easily. Consider a regular polygon with 2n + 1 vertices, like the 7-gon 
shown below. 


(1.127) 


The group of symmetries of the 2n+1-gon consists of the identity, the 2n 
277 


rotations by 2774 for j = 1to n, 


+2 
rotations by 5 = forj =lton (1.128) 
n 


and the 2n+1 reflections about lines through the center and a vertex, as show 
below: 


reflections about lines through center and vertex (1.129) 


(1.130) 


Thus the order of the group of symmetries is N = 2 x (2n + 1). 
There are n + 2 conjugacy classes: 


1— the identity, e; 
2— the 2n+1 reflections; 


3 ton+2— the rotations by zx) for j — 1 to n — each value of j isa 
separate conjugacy class. 


The way this works is that the reflections are all in the same conjugacy 
class because by conjugating with rotations, you can get from any one reflec- 
tion to any other. The rotations are unchanged by conjugation by rotations, 
but a conjugation by a reflection changes the sign of the rotation, so there is 
a + pair in each conjugacy class. 
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Furthermore, the n conjugacy classes of rotations are equivalent under 
cyclic permutations and relabeling of the vertices, as shown below: 


(1.131) 


(1.132) 


The characters look like 


(1.133) 


Anm 
DEX 2 cos 2 y ul 2 cos mI E 2 cos 2 oT 


In the last line, the different values of m give the characters of the n different 
2-dimensional representations. 


1.19 Permutation group on n objects 


Any element of the permutation group on n objects, called Sp, can be written 
in term of cycles, where a cycle is a cyclic permutation of a subset. We 
will use a notation that makes use of this, where each cycle is written as a 
set of numbers in parentheses, indicating the set of things that are cyclicly 
permuted. For example: 

(1) means z1 — zi 

(1372) means z; — 23 —27;— 29 T1 

Each element of S; involves each integer from I to n in exactly one 
cycle. 

Examples: 

The identity element looks like e —(1)(2)- - (n) — n I-cycles — there is 
only one of these. 
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An interchange of two elements looks like (12)(3)- - (n) — a 2-cycle and 
n — 2 1-cycles — there are n(n — 1)/2 of these — (7172)(73) -> Gn). 
An arbitrary element has k; j-cycles, where 


n 
X jkj=n (1.134) 
j=l 


For example, the permutation (123)(456)(78)(9) has two 3-cycles, 1 2-cycle 
and a 1-cycle, so ky = k2 = 1 and k3 = 2. 

There is an simple (but reducible) n dimensional representation of Sn 
called the defining representation where the “objects” being permuted are 
just the basis vectors of an n dimensional vector space, 


15 125; in) (1.135) 


If the permutation takes x; to xz, the corresponding representation operator 
D takes |j) to |k}, so that 


D |j) = |k) (1.136) 


and thus 
(| D |j) = ore (1.137) 


Each matrix in the representation has a single 1 in each row and column. 


1.20 Conjugacy classes 


The conjugacy classes are just the cycle structure, that is they can be labeled 
by the integers k;. For example, all interchanges are in the same conjugacy 
class — it is enough to check that the inner automorphism ggig~! doesn't 
change the cycle structure of g; when g is an interchange, because we can 
build up any permutation from interchanges. Let us see how this works 
in some examples. In particular, we will see that conjugating an arbitrary 
permutation by the interchange (12)(3)- -- just interchanges 1 and 2 without 
changing the cycle structure 


Examples — (12)(3)(4)-(1)(23)(4)-(2)(3)(4) (note that an interchange 
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is its own inverse) 
1234 


(12)(3)(4) 
(1)(23)(4) 
(12) (3) (4) | (1.138) 


1 
2134 
1 
2314 
} 
3214 
1234 
i 
3214 
(12)(3)(4)-(1)(234)-(12)(3)(4) 

1234 


(2)(13)(4) 


(12) (3) (4) 
(1)(234) 
(12)(3)(4) (1.139) 


1 
2134 
1 
2341 
+ 
3241 
1234 


+ | (2)(134) 
3241 
If 1 and 2 are in different cycles, they just get interchanged by conjuga- 


tion by (12), as promised. 
The same thing happens when | and 2 are in the same cycle. For example 


1234 
L (12)(3)(4) 
2134 
l (123)(4) 
1394 
L (12)(3)(4) (1.140) 
3124 
1234 u 
+ (213)(4) 
3124 


Again, in the same cycle this time, 1 and 2 just get interchanged. 

Another way of seeing this is to notice that the conjugation is analo- 
gous to a similarity transformation. In fact, in the defining, n dimensional 
representation of (1.135) the conjugation by the interchange (12) is just a 
change of basis that switches |1) + |2). Then it is clear that conjugation 
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does not change the cycle structure, but simply interchanges what the permu- 
tation does to 1 and 2. Since we can put interchanges together to form an 
arbitrary permutation, and since by repeated conjugations by interchanges, 
we can get from any ordering of the integers in the given cycle structure to 
any other, the conjugacy classes must consist of all possible permutations 
with a particular cycle structure. 

Now let us count the number of group elements in each conjugacy class. 
Suppose a conjugacy class consists of permutations of the form of kı 1- 
cycles, ka 2-cycles, etc, satisfying (1.134). The number of different permuta- 
tions in the conjugacy class is 


n! 
II ji kj! 


because each permutation of number 1 to n gives a permutation in the class, 
but cyclic order doesn't matter within a cycle 


(1.141) 


(123) is the same as (231) (1.142) 
and order doesn't matter at all between cycles of the same length 


(12)(34) is the same as (34)(12) (1.143) 


1.231 Young tableaux 


It is useful to represent each 7-cycle by a column of boxes of length 7, top- 
justified and arranged in order of decreasing 7 as you go to the right. The 
total number of boxes is n. Here is an example: 


[LLL] (1.144) 


is four 1-cycles in S4 — that is the identity element — always a conjugacy 
class all by itself. Here’s another: 


(1.145) 


is a 4-cycle, a 3-cycle and a 1-cycle in Sg. These collections of boxes are 
called Young tableaux. Each different tableaux represents a different conju- 
gacy class, and therefore the tableaux are in one-to-one correspondence with 
the irreducible representations. 
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1.22 Example — our old friend 5; 


The conjugacy classes are 


[EE H- H (1.146) 


with numbers of elements 


3! 3! 3! 
Reger — — 1.14 
3! a 2 ? 3 i AFO 


1.23 Another example — 54 


on FPG Bom 


with numbers of elements 


! ! ! 4! 4! 
fj used oec (1.149) 


4 4 8 3 4 


The characters of 54 look like this (with the conjugacy classes which 
label the columns in the same order as in (1.148)): 


conjugacy classes 


(1.150) 


The first row represents the trivial representation. 


1.24 * Young tableaux and representations of 5, 


We have seen that a Young tableau with n boxes is associated with an irre- 
ducible representation of S. We can actually use the tableau to explicitly 
construct the irreducible representation by identifying an appropriate sub- 
space of the regular representation of Sp. 
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To see what the irreducible representation is, we begin by putting the 
integers from 1 to n in the boxes of the tableau in all possible ways. There 
are n! ways to do this. We then identify each assignment of integers 1 to 
n to the boxes with a state in the regular representation of Sn by defining 
a standard ordering, say from left to right and then top down (like reading 
words on a page) to translate from integers in the boxes to a state associated 
with a particular permutation. So for example 


6151312] (1.151) 
— |6532174) 
4 


where |6532174) is the state corresponding to the permutation 
1234567 — 6532174 (1.152) 


Now each of the n! assignment of boxes to the tableau describes one of the 
n! states of the regular representation. 

Next, for a particular tableau, symmetrize the corresponding state in the 
numbers in each row, and antisymmetrize in the numbers in each column. For 
example 


— |12} + |21) (1.153) 


and 
— |123) + [213) — [321) — |231) PR 

Now the set of states constructed in this ways spans some subspace of 
the regular representation. We can construct the states explicitly, and we 
know how permutations act on these states. That the subspace constructed 
in this way is a representation of Sn, because a permutation just corresponds 
to starting with a different assignment of numbers to the tableau, so acting 
with the permutation on any state in the subspace gives another state in the 
subspace. In fact, this representation is irreducible, and is the irreducible 
representation we say is associated with the Young tableau. 

Consider the example of S3. The tableau 


QL] (1.155) 


gives completely symmetrized states, and so is associated with a one dimen- 
sional subspace that transforms under the trivial representation. The tableau 


H (1.156) 
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gives completely antisymmetrized states, and so, again is associated with a 
one dimensional subspace, this time transforming under the representation in 
which interchanges are represented by —1. Finally 


m (1.157) 


gives the following states: 


— |123) + |213) — |321) — |231) (1.158) 
— |321) + |231) — |123) — |213) (1.159) 
— |231} + |321) — |132) — |312) (1.160) 
— |132) + |312) — |231) — |321) (1.161) 
— |312) + |132) — |213) — |123) (1.162) 

(1.163) 


— |213) + |123} — |312) — |132) 


Note that interchanging two numbers in the same column of a tableau just 
changes the sign of the state. This is generally true. Furthermore, you can 
see explicitly that the sum of three states related by cyclic permutations van- 
ishes. Thus the subspace is two dimensional and transforms under the two 
dimensional irreducible representation of 55. 

It turns out that the dimension of the representation constructed in this 
way is l 

n! 

H (1.164) 
where the quantity H is the “hooks” factor for the Young tableau, computed 
as follows. A hook is a line passing vertically up through the bottom of 
some column of boxes, making a right hand turn in some box and passing out 
through the row of boxes. There is one hook for each box. Call the number 
of boxes the hook passes through A. Then H is the product of the hs for 
all hooks. We will come back to hooks when we discuss the application of 
Young tableaux to the representations of SU(N) in chapter XIII. 

This procedure for constructing the irreducible representations of S, is 
entirely mechanical (if somewhat tedious) and can be used to construct all the 
representations of Sn from the Young tableaux with n boxes. 
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We could say much more about finite groups and their representations, 
but our primary subject is continuous groups, so we will leave finite groups 
for now. We will see, however, that the representations of the permutation 
groups play an important role in the representations of continuous groups. So 
we will come back to S» now and again. 


Problems 


1.A. Find the multiplication table for a group with three elements and 
prove that it is unique. 


1.B. Find all essentially different possible multiplication tables for groups 
with four elements (which cannot be related by renaming elements). 


1.C. Show that the representation (1.135) of the permutation group is 
reducible. 


1.D. Suppose that D, and D» are equivalent, irreducible representations 
of a finite group G, such that 


D2(g) = SDi(g)S* Vg EG 
What can you say about an operator A that satisfies 


AD,(g) = Do(g)A Vg € G? 


1.E. Find the group of all the discrete rotations that leave a regular tetra- 
hedron invariant by labeling the four vertices and considering the rotations 
as permutations on the four vertices. This defines a four dimensional repre- 
sentation of a group. Find the conjugacy classes and the characters of the 
irreducible representations of this group. 


*1.F. Analyze the normal modes of the system of four blocks sliding on 
a frictionless plane, connected by springs as shown below: 
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just as we did for the triangle, but using the 8-element symmetry group of 
the square. Assume that the springs are rigidly attached to the masses (rather 
than pivoted, for example), so that the square has some rigidity. 


Chapter 2 


Lie Groups 


Suppose our group elements g € G depend smoothly on a set of continuous 
parameters — 


g(a) (2.1) 


What we mean by smooth is that there is some notion of closeness on the 
group such that if two elements are “close together” in the space of the group 
elements, the parameters that describe them are also close together. 


2. Generators 


Since the identity is an important element in the group, it is useful to param- 
eterize the elements (at least those close to the identity element) in such a 
way that œ = 0 corresponds to the identity element. Thus we assume that in 
some neighborhood of the identity, the group elements can be described by a 
function of N real parameters, a, for a = 1 to N, such that 


gla)la=0 = € Q.2) 


Then if we find a representation of the group, the linear operators of the rep- 
resentation will be parameterized the same way, and 


D(a)l, y —1 (2.3) 


Then in some neighborhood of the identity element, we can Taylor expand 
D(a), and if we are close enough, just keep the first term: 


D(da) = 1+ ida,X, +: (2.4) 


43 DOI: 10.1201/97804294992 10-3 
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where we have called the parameter da to remind you that it is infinitesimal. 
In (2.4), a sum over repeated indices is understood (the “Einstein summation 
convention") and 


i D(o) (2.5) 


Xqg=-1 
OO a=0 


The X, fora = 1 to N are called the generators of the group. If the 
parameterization is parsimonious (that is — all the parameters are actually 
needed to distinguish different group elements), the X, will be independent. 
The i is included in the definition (2.5) so that if the representation is unitary, 
the X, will be hermitian operators. 

Sophus Lie showed how the generators can actually be defined in the 
abstract group without mentioning representations at all. As a result of his 
work, groups of this kind are called Lie groups. I am not going to talk about 
them this way because I am more interested in representations than in groups, 
but it is a beautiful theoretical construction that you may want to look up if 
you haven't seen it. 

As we go away from the identity, there is enormous freedom to param- 
eterize the group elements in different ways, but we may as well choose our 
parameterization so that the group multiplication law and thus the multipli- 
cation law for the representation operators in the Hilbert space looks nice. 
In particular, we can go away from the identity in some fixed direction by 
simply raising an infinitesimal group element 


D(do) = 1 + ida, X, (2.6) 


to some large power. Because of the group property, this always gives an- 
other group element. This suggests defining the representation of the group 
elements for finite a as 


D(a) = lim (1 ios X, [ky Sees (2.7) 


In the limit, this must go to the representation of a group element because 
1 + ia, X,_/k becomes the representation of a group element in (2.4) as k 
becomes large. This defines a particular parameterization of the representa- 
tions (sometimes called the exponential parameterization), and thus of the 
group multiplication law itself. In particular, this means that we can write the 
group elements (at least in some neighborhood of e) in terms of the genera- 
tors. That’s nice, because unlike the group elements, the generators form a 
vector space. They can be added together and multiplied by real numbers. In 
fact, we will often use the term generator to refer to any element in the real 
linear space spanned by the Xas. 
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2.2 Lie algebras 


Now in any particular direction, the group multiplication law is uncompli- 
cated. There is a one parameter family of group elements of the form 


Uy cens (2.8) 
and the group multiplication law is simply 
U(A1)U 02) = U (ài + 22) (2.9) 


However, if we multiply group elements generated by two different linear 
combinations of generators, things are not so easy. In general, 


eitaXa eiP Xo # etla tba) Xa (2.10) 


On the other hand, because the exponentials form a representation of the 
group (at least if we are close to the identity), it must be true that the product 
is some exponential of a generator, 


gie Xa pipt Xs = etla X« (2.11) 
for some ó. And because everything is smooth, we can find 6, by expanding 
both sides and equating appropriate powers of o and 8. When we do this, 
something interesting happens. We find that it only works if the generators 


form an algebra under commutation (or a commutator algebra). To see this, 
let's actually do it to leading nontrivial order. We can write 


ió, X, — In (1 4 gita Xa gil Xy... 1) (2.12) 


I will now expand this, keeping terms up to second order in the parameters o 
and £, using the Taylor expansion of In(1 + K) where 
K = eitaXa gif Xy _ 1 
= (1 + io4X, — 5 (o4 X? +--+) 
(1 + iBy Xy — E(X)? Fiel (2.13) 
= ada + iBo Xa — aa XaBy Xy 


-3(e«X4)* p 3 (AX spore 


46 CHAPTER 2. LIE GROUPS 


This gives 
ió X, = K — EK? 4. 
= 10g X, + iB. Xa v Qa Xa B X y 


2.14 
- lax, mi 5(BaXa)? 


+5 (aKa + f Xa? E REA 


Now here is the point. The higher order terms in (2.14) are trying to cancel. 
If the Xs were numbers, they would cancel, because the product of the ex- 
ponentials is the exponential of the sum of the exponents. They fail to cancel 
only because the Xs are linear operators, and don't commute with one an- 
other. Thus the extra terms beyond £o X, 4- 20, X, in (2.14) are proportional 
to the commutator. Sure enough, explicit calculation in (2.14) gives 


1 
it X, = K - 5K? + 


= dog Xa + if Xa (2.15) 


1 
79$ [fa Xa; By X; desse 


We obtained (2.15) using only the group property and smoothness, which 
allowed us to use the Taylor expansion. From (2.15) we can calculate do, 
again in an expansion in o and 8. We conclude that 


[Oa Xa; By Xi] = —2i(ó. — Qe — Bc) Xe Tm VycXc (2.16) 


where the 2 is put in to make y real and the - - - represent terms that have more 
than two factors of o or B. Since (2.16) must be true for all a and 8, we must 
have 


Yo = Car fabe (2.17) 
for some constants fabe, thus 
[Xa, Xb] = i fabeXc- (2.18) 
where 
fabe = — fbac (2.19) 
because [A, B] = —[B, A]. Note that we can now write 
1 
fa = Qa + pa ye (2.20) 


2 
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so that if y and the higher terms vanish, we would restore the equality in 
(2.10). 

(2.18) is what is meant by the statement that the generators form an alge- 
bra under commutation. We have just shown that this follows from the group 
properties for Lie groups, because the Lie group elements depend smoothly 
on the parameters. The commutator in the algebra plays a role similar to the 
multiplication law for the group. 

Now you might worry that if we keep expanding (2.12) beyond second 
order, we would need additional conditions to make sure that the group mul- 
tiplication law is maintained. The remarkable thing is that we don't. The 
commutator relation (2.18) is enough. In fact, if you know the constants, 
faic, you can reconstruct 6 as accurately as you like for any a and fj in some 
finite neighborhood of the origin! Thus the fabc are tremendously important 
— they summarize virtually the entire group multiplication law. The fa. are 
called the structure constants of the group. They can be computed in any 
nontrivial representation, that is unless the Xa vanish. 

The commutator relation (2.18) is called the Lie algebra of the group. 
The Lie algebra is completely determined by the structure constants. Each 
group representation gives a representation of the algebra in an obvious way, 
and the structure constants are the same for all representations because they 
are fixed just by the group multiplication law and smoothness. Equivalence, 
reducibility and irreducibility can be transferred from the group to the algebra 
with no change. 

Note that if there is any unitary representation of the algebra, then the 
Jabes are real, because if we take the adjoint of the commutator relation for 
hermitian Xs, we get 


[Xa, XJ! =- faicXc 


; . (2.21) 
= [Xo, Xa] = i fbacXe = —i fabeXe 


Since we are interested in groups which have unitary representations, we will 
just assume that the fabe are real. 


2.3 The Jacobi identity 
The matrix generators also satisfy the following identity: 


[Xa, [X», X.]] + cyclic permutations = 0. (2.22) 
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called the Jacobi identity, which you can check by just expanding out the 
commutators. ! 

The Jacobi identity can be written in a different way that is sometimes 
easier to use and is also instructive: 


(Xa; (Xo, X.]] = ([Xa, Xi] Xe] + [X», (Xa, Xel] d (2.23) 
This is a generalization of the product rule for commutation: 
(Xa, Xy Xc] = [Xa, Xi] Xc + Xo | Xo, Xe]. (2.24) 


The Jacobi identity is rather trivial for the Lie algebras with only finite dimen- 
sional representations that we will study in this book. But it is worth noting 
that in Lie's more general treatment, it makes sense in situations in which the 
product of generators is not even well defined. 


2.4 The adjoint representation 


The structure constants themselves generate a representation of the algebra 
called the adjoint representation. If we use the algebra(2.18), we can com- 
pute 

(Xa; [X», Xel] 


= i foca (Xa, Xa] (2.25) 
E — focafade Xe 
so (because the X, are independent), 2.22) implies 
Toca fade + fata f cde + fead fode =0. (2.26) 
Defining a set of matrices Ta 
[Taloc = —tfabe (2.27) 
then (2.26) can be rewritten as 
(Ta, Tij = i fabcTe (2.28) 


Thus the structure constants themselves furnish a representation of the alge- 
bra. This is called the adjoint representation. The dimension of a repre- 
sentation is the dimension of the linear space on which it acts (just as for a 


'The Jacobi identity is really more subtle than this. We could have proved it directly in 
the abstract group, where the generators are not linear operators on a Hilbert space. Then 
the algebra involves a "Lie product" which is not necessarily a commutator, but nevertheless 
satisfies the Jacobi identity. 
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finite group). The dimension of the adjoint representation is just the number 
of independent generators, which is the number of real parameters required 
to describe a group element. Note that since the fabes are real, the generators 
of the adjoint representation are pure imaginary. 

We would like to have a convenient scalar product on the linear space 
of the generators in the adjoint representation, (2.27), to turn it into a vector 
space. A good one is the trace in the adjoint representation 


eX (7,1) (2.29) 


This is a real symmetric matrix. We will next show that we can put it into 
a very simple canonical form. We can change its form by making a linear 
transformation on the Xa, which in turn, induces a linear transformation on 
the structure constants. Suppose 


Xa > X; = LabXo (2.30) 
then 
[Xa Xi] = i LaaLiefaecXc 
=í LoaLie faeg Ln LincXe (2.31) 
=í LoaLoefaeg Lze Xe 
so? 
fave = fabe = LadLiefdeg Lze (2.32) 
If we then define a new Tas with the transformed fs, 
[Talc => alte = LoaLse[Ta]eg Lze (2.33) 
or 
[Ta] > [Ta] = LoaL(Ta]L (2.34) 


In other words, a linear transformation on the Xas induces a linear transfor- 
mation on the Tas which involves both a similarity transformation and the 
same linear transformation on the a index that labels the generator. But in the 
trace the similarity transformation doesn't matter, so 


Tr(1511) = Tr (7715) = LocLba Tr(T.Ta) (2.35) 


Because of the L^! in (2.32), it would be make sense to treat the third index in Jabe 
differently, and write it as an upper index — f*,. We will not bother to do this because we are 
going to move very quickly to a restricted set of groups and basis sets in which Tr(T, Tẹ) œ 
dab. Then only orthogonal transformation on the Xas are allowed, L^! = LT, so that all 
three indices are treated in the same way. 
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Thus we can diagonalize the trace by choosing an appropriate L (here we 
only need an orthogonal matrix). Suppose we have done this (and dropped 
the primes), so that 


Tr(TaTe) = k^ó,y no sum (2.36) 


We still have the freedom to rescale the generators (by making a diagonal 
L transformation), so for example, we could choose all the non-zero k*s to 
have absolute value 1. But, we cannot change the sign of the k*s (because L 
appears squared in the transformation (2.35)). 

For now, we will assume that the k^s are positive. This defines the 
class of algebras that we study in this book. They are called the compact 
Lie algebras. We will come back briefly below to algebras in which some 
are zero.? And we will take 


Tr(TnTh) = À fab (2.37) 


for some convenient positive A. In this basis, the structure constants are com- 
pletely antisymmetric, because we can write 


fuc = —id~* Tr([Ta, To] To) (2.38) 
which is completely antisymmetric because of the cyclic property of the trace. 


Tr([To, Te] Te) = (To T Te = ToTaTe) 


= (ToTeTa — TIVI) = Tr((T%, Te] Ta) (2.39) 


which implies 
fabe = foca - (2.40) 


Taken together, (2.19) and (2.40) imply the complete antisymmetry of fabe 
fate = foca = "n (2.41) 
= — fbac = — facb = — feba - 


In this basis, the adjoint representation is unitary, because the T, are imagi- 
nary and antisymmetric, and therefore hermitian. 


Algebras in which some of the kas are negative have no nontrivial finite dimensional 
unitary representations. This does not mean that they are not interesting (the Lorentz group is 
one such), but we will not discuss them. 
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2.5 Simple algebras and groups 


An invariant subalgebra is some set of generators which goes into itself 
under commutation with any element of the algebra. That is, if X is any 
generator in the invariant subalgebra and Y is any generator in the whole al- 
gebra, [Y, X] is a generator in the invariant subalgebra. When exponentiated, 
an invariant subalgebra generates an invariant subgroup. To see this note that 


h=6¥, g= (2.42) 
g hg = è” (2.43) 
where 
I —iY iY : 1 
X =e Xe =X—il¥,X]- 5I I, X] e (2.44) 


Note that the easy way to see this is to consider 
X'(e) = eY x eieY (2.45) 


then Taylor expand in e and set € = 1. Each derivative brings another com- 
mutator. Evidently, each of the terms in X' is in the subalgebra, and thus eX 
is in the subgroup, which is therefore invariant. 

The whole algebra and 0 are trivial invariant subalgebras. An algebra 
which has no nontrivial invariant subalgebra is called simple. A simple alge- 
bra generates a simple group. 

The adjoint representation of a simple Lie algebra satisfying (2.37) is 
irreducible. To see this, assume the contrary. Then there is an invariant sub- 
space in the adjoint representation. But the states of the adjoint representation 
correspond to generators, so this means that we can find a basis in which the 
invariant subspace is spanned by some subset of the generators, T, for r = 1 
to K. Call the rest of the generators Ty for r = K + 1 to N. Then because 
the rs span an invariant subspace, we must have 


(Tolar =-1 fazr = 0 (2.46) 


for all a, z and r. Because of the complete antisymmetry of the structure 
constants, this means that all components of f that have two rs and one z or 
two zs and one r vanish. But that means that the nonzero structures constants 
involve either three rs or three zs, and thus the algebra falls apart into two 
nontrivial invariant subalgebras, and is not simple. Thus the adjoint represen- 
tation of a simple Lie algebra satisfying (2.37) is irreducible. 
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We will often find it useful to discuss special Abelian invariant subalge- 
bras consisting of a single generator which commutes with all the generators 
of the group (or of some subgroup we are interested in). We will call such 
an algebra a U(1) factor of the group. U(1) is the group of phase transfor- 
mations. U(1) factors do not appear in the structure constants at all. These 
Abelian invariant subalgebras correspond to directions in the space of gener- 
ators for which ka = 0 in (2.36). If X, isa U(1) generator, fabe = 0 for all b 
and c. That also means that the corresponding X^ is zero, so the trace scalar 
product does not give a norm on the space. The structure constants do not tell 
us anything about the U (1) subalgebras. 

Algebras without Abelian invariant subalgebras are called semisimple. 
They are built, as we will see, by putting simple algebras together. In these 
algebras, every generator has a non-zero commutator with some other gen- 
erator. Because of the cyclic property of the structure constants, (2.38), this 
also implies that every generator is a linear combination of commutators of 
generators. In such a case, the structure constants carry a great deal of in- 
formation. We will use them to determine the entire structure of the algebra 
and its representations. From here on, unless explicitly stated, we will dis- 
cuss semisimple algebras, and we will deal with representations by unitary 
operators. 


2.6 States and operators 


The generators of a representation (like the elements of the representations 
they generate) can be thought of as either linear operators or matrices, just as 
we saw when we were discussing representations of finite groups — 


Xali) = 61X«li = |9)[Xalyi (2.47) 


with the sum on j understood. As in (1.98), the states form row vectors and 
the matrix representing a linear operator acts on the right. 

In the Hilbert space on which the representation acts, the group elements 
can be thought of as transformations on the states. The group element e*@+*« 
maps or transforms the kets as follows: 


li) 2 |’) = eo. (2.48) 
Taking the adjoint shows that the corresponding bras transform as 


(| > (i| = (ie? , (2.49) 
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The ket obtained by acting on |i) with an operator O is a sum of kets, and 
therefore must also transform as in (2.48). 


Oli) > e^«^« 0|) 


= ettaXa Qe 10s Xo giae Xa |) = O' i") : (2.50) 
This implies that any operator O transforms as follows: 
OO = dion) s ger aka, Q.51) 


The transformation leaves all matrix elements invariant. 
The action of the algebra on these objects is related to the change in the 
state of operator under an infinitesimal transformation. 


—ió|i) = —i((1 + ios X4)|i) — |é)) = aa Xai) (2.52) 
—ió(i| = —(ilos X, (2.53) 
—i6O = [o4 X,,O] . (2.54) 


Thus, corresponding to the action of the generator X, on a ket 
Xali) (2.55) 


is — X, acting on a brat 
X (2.56) 


and the commutator of X with an operator 
[Xq, O] . (2.57) 
Then the invariance of a matrix element (i|O|7) is expressed by the fact, 


(ilO (Xali)) + (il [Xa, O] ft) — (a1 Xa) Oli) = 0. (2.58) 


2.7 Fun with exponentials 


Consider the exponential 
gita Xa (2.59) 


“The argument above can be summarized by saying that the minus signs in (2.56) and in 
the commutator in (2.57) come ultimately from the unitarity of the transformation, (2.48). 
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where Xa is a representation matrix. We can always define the exponential 
as a power series, 

dos. yo Gne" (2.60) 

beri n! 

However, it is useful to develop some rules for dealing with these things with- 
out expanding, like our simple rules for exponentíals of commuting num- 
bers. We have already seen that the multiplication law is not as simple as 
just adding the exponents. You might guess that the calculus is also more 
complicated. In particular, 


dex, A iX, eic Xa (2.61) 
Qo 


However, it is true that 


d j ; T 

q,9 = day X, etes = jeltte aX, (2.62) 
S 

because o X, commutes with itself. This is very important, because you can 

often use it to derive other useful results. It is also true that 


z— eé9X — dX, 2.63 
Oo; n b (2.63) 
because this can be shown directly from the expansion. It is occasionally 
useful to have a general expression for the derivative. Besides, it is a beautiful 
formula, so I will write it down and tell you how to derive it. The formula is 
a. AE l l 
LL ¢idaXa = n ds gi59« Xa (Xp) el(l—s)acXe (2.64) 
Oap 0 
I love this relation because it is so nontrivial, yet so easy to remember. The 
integral just expresses the fact that the derivative may act anywhere “inside” 
the exponential, so the result is the average of all the places where the deriva- 
tive can act. One way of deriving this is to define the exponential as a limit 
as in (2.7). 
ei?«X« — lim (1 + iagXo/k)* (2.65) 
k= 


and differentiate both sides — the result (2.64) is then just an exercise in 
defining an integral as a limit of a sum. Another way of doing it is to expand 
both sides and use the famous integral 


min! 
i ds s” (1 — s) = GRE (2.66) 


We will see other properties of exponentials of matrices as we go along. 


2.7. FUN WITH EXPONENTIALS 
Problems 
2.A. Find all components of the matrix e24 where 
0 0 1 
A={0 0 0 
100 


2.B. If [A, B] = B, calculate 


elaA B PR 


55 


2C. Carry out the expansion of 6, in (2.11) and (2.12) to third order in 


o and f (one order beyond what is discussed in the text). 


Chapter 3 


SU(2) 


The SU (2) algebra is familiar.! 
[Jj; Jk] = tejneJe (3.1) 


This is the simplest of the compact Lie algebras because e;; for i, j,k = 
1 to 3 is the simplest possible completely antisymmetric object with three in- 
dices. (3.1) is equivalent (in units in which A = 1) to the angular momentum 
algebra that you studied in quantum mechanics. In fact we will only do two 
things differently here. One is to label the generators by 1, 2 and 3 instead 
of z, y and z. This is obviously a great step forward. More important is the 
fact that we will not make any use of the operator JaJa. Initially, this will 
make the analysis slightly more complicated, but it will start us on a path that 
generalizes beautifully to all the other compact Lie algebras. 


3.1 J; eigenstates 


Our ultimate goal is to completely reduce the Hilbert space of the world to 
block diagonal form. To start the process, let us think about some finite space, 
of dimension N, and assume that it transforms under some irreducible repre- 
sentation of the algebra. Then we can see what the form of the algebra tells 
us about the representation. Clearly, we want to diagonalize as many of the 
elements of the algebra as we can. In this case, since nothing commutes with 
anything else, we can only diagonalize one element, which we may as well 
take to be J3. When we have done that, we pick out the states with the highest 
value of J3 (we can always do that because we have assumed that the space 


! We will see below why the name SU(2) is appropriate. 
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is finite dimensional). Call the highest value of J3 7. Then we have a set of 


States 
Ja]7, a) = j|j, a) (3.2) 


where a is another label, only necessary if there is more than one state of 
highest Jz (of course, you know that we really don't need o because the 
highest state is unique, but we haven't shown that yet, so we will keep it). We 
can also always choose the states so that 


(j, alj, 8) = bag (3.3) 


3.2 Raising and lowering operators 


Now, just as in introductory quantum mechanics, we define raising and low- 
ering operators, 


JE = (A t iJ2)/ V2 (3.4) 
satisfying 
[J3, J*] = +5* (3.5) 
[J+, J7] eed (3.6) 
so they raise and lower the value of J3 on the states. If 
Ja|m) = m|m) (3.7) 
then 
J3J*|m) = JE? Jm) + J*|m) = (m +1) J*|m) (3.8) 


The key idea is that we can use the raising and lowering operators to 
construct the irreducible representations and to completely reduce reducible 
representations. This idea is very simple for SU(2), but it is very useful to 
see how it works in this simple case before we generalize it to an arbitrary 
compact Lie algebra. 

There is no state with J3=7+1 because we have assumed that j is the 
highest value of J3. Thus it must be that 


J*|j;,o) =0Va (3.9) 


because any non-zero states would have J3=7+1. The states obtained by act- 
ing with the lowering operator have J3=7—1, so it makes sense to define 


J 1j, o) = N;(a)|j — 1,0) (3.10) 
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where N;(o) is a normalization factor. But we easily see that states with 
different o are orthogonal, because 


N;(B)' N;(a)G ~ 1, 8lj — 1,0) 

= (j, lJ} JT |j, a) 

= (j, B| [J*, JT] |j o) (3.11) 
= (j, 8|Jalj; o) 

= j(7, plj a) = j bag 


Thus we can choose the states |j — 1, œ} to be orthonormal by choosing 
Nj(o) = vj =N; (3.12) 
Then in addition to (3.10), we have 


. 1 E 
J*[ EE 1,0) > xb 1j; o) 
1 J 
= 7, VT] (3.13) 
J 
‘ar : 
= N; ^o = Nj|j, o) 


The point is that because of the algebra, we can define the states so that 
the raising and lowering operators act without changing o. That is why the 
parameter a is eventually going to go away. Now an analogous argument 
shows that there are orthonormal states |j — 2, a) satisfying 


J B S 1, a) = Nj Mi 2, a) 
3.14 
Feza capi e, a) SM 


Continuing the process, we find a whole tower of orthonormal states, |j — 
k, a) satisfying 
J|j-ko)N;.xj-k-l, 
lj — ko) = Nj-lj a) (3.15) 
Su hs 1, a) = Njx|j m k, a) 


The Ns can be chosen to be real, and because of the algebra, they satisfy 
Ni, —G6-hoJ*J-|j - k a) 

= (j 7 k, a| [J*, J] lj = k, a) 

TG m k, o|J- J*|j zd k, a) 

=N FT tj-k 


(3.16) 
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This is a recursion relation for the Ns which is easy to solve by starting with 
N:: 
j^ 


N? =j 
N, — N? -j-1 
Ni, E N? 44 =j-k (ED 


N2, = (k+1)j—k(k+1)/2 
= 4(k+1)(2j-k) 


or setting k = j —m 


Neve E CEU -AE (3.18) 
Because the representation is finite dimensional (by assumption — we haven't 
proved this) there must be some maximum number of lowering operators, £, 
that we can apply to |j, à). We must eventually come to some m = j — £ such 
that applying any more lowering operators gives 0. Then £ is a non-negative 
integer specifying the number of times we can lower the states with highest 
J3. Another lowering operator annihilates the state — 


J^|j - £&a) «0. (3.19) 


But then the norm of J^ |j — £, œ} must vanish, which means that 


Nja- Jai - 001 =0 (3.20) 


the factor £ + 1 cannot vanish, thus we must have 
£22j. (3.21) 


Thus 


j- a8 : 
j= 5 for some integer £. (3.22) 


Now we can get rid of o. It is now clear that the space breaks up into 
subspaces that are invariant under the algebra, one for each value of a, be- 
cause the generators do not change o. Thus from our original assumption of 
irreducibility, there must be only one a value, so we can drop the a entirely. 
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Furthermore, there can be no other states, or the subspace we just constructed 
would be nontrivial (and invariant). Thus we have learned how the generators 
act on all the finite dimensional irreducible representations. In fact, though 
we won't prove it, there are no others — that is all representations are finite 
dimensional, so we know all of them. 


3.3 The standard notation 


We can now switch to the standard notation in which we label the states of 
the irreducible representations by the highest J3 value in the representation 
and the J3 value? 


|j, m) (3.23) 


and the matrix elements of the generators are determined by the matrix ele- 
ments of Jz and the raising and lowering operators, J*:3 


(j, m'|Js|j, m) =m Óm'm 
(jm |J ljm) = JG +m + 1)(j —m)/2 m m+ (3.24) 


(j;m'|J^ |j; m) = y (j + m)G — m + 1)/2 m m-1 


These matrix elements define the spin j representation of the SU (2) algebra: 
[Jilke = (Jj c 1— kal i + 1-2) (3.25) 


Here we have written the matrix elements in the conventional language where 
the rows and columns are labeled from 1 to 27 + 1. In this case, it is often 
convenient to label the rows and columns directly by their m values, which 
are just 7 + 1 — Z and j + 1 — k above in (3.25). In this notation, (3.25) would 
read 


[J2]pm = (G, m'|Ja|j, m) (3.26) 


where m and m’ run from j to —j in steps of —1. We will use these in- 
terchangeably — choosing whichever is most convenient for the problem at 
hand. 


?^well, not completely standard — in some books, including the first edition of this one, 
the j and m are written in the other order. 

>The /2 factors are the result of our definition of the raising and lowering operators and 
are absent in some other treatments. 
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For example, for 1/2, this gives the spin 1/2 representation 


241 0 2 
2_1/0 —Y ll (3.27) 
J2 =5 (5 3) = 5% 
ie deel 0)\_1 
Fe n 


0 1 0 -—i re 0 
asli » n-(1 a) n - (4 e (3.28) 
satisfying 
Oadb = Óqp + i€apcOc (3.29) 
The spin 1/2 representation is the simplest representation of SU(2). It is 
called the "defining" representation of SU (2), and is responsible for the name 
SU, which is an acronym for "Special Unitary". Exponentiating the gener- 


ators of the spin 1/2 representation to get the representation of finite group 
elements gives matrices of the form 


gi 4/2 (3.30) 


which are the most general 2 x 2 unitary matrices with determinant 1. The 
"special", in Special Unitary means that the determinant is 1, rather than an 
arbitrary complex number of absolute value 1. 

All the other irreducible representations can be constructed similarly. For 
example, the spin 1 representation looks like 


1 ( 1 j 
Ji=—=(101 
v2 0 1 0 


0 -i 0 
es (i 0 3 (3.31) 
V2\o i4 0 
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while the spin 3/2 representation is 


o yi o o 
gira "H 0 2 0 
: 0 2 0 Ji 
0 0 Jio 
0 -yi o 0 
pe |v 0 -a o (3.32) 
? 0 % oO i 
0 0 à 0 
20 0 0 
pe —| 9 i 0 0 
3 0-0 42 c0 
00 0 -3 


The construction of the irreducible representations above generalizes to 
any compact Lie algebra, as we will see. The J3 values are called weights, 
and the analysis we have just done is called the highest weight construc- 
tion because it starts with the unique highest weight of the representation. 
Note that the same construction provides a systematic procedure for bringing 
an arbitrary finite dimensional representation into block diagonal form. The 
procedure is as follows: 


1. Diagonalize J3. 
2. Find the states with the highest J3 value, 7. 


3. For each such state, explicitly construct the states of the irreducible 


spin j representation by applying the lowering operator to the states 
with highest J3. 


4. Now set aside the subspace spanned by these representations, which 
is now in canonical form, and concentrate on the subspace 
orthogonal to it. 


5. Take these remaining states, go to step 2 and start again with the 
states with next highest J3 value. 
(3.33) 
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The end result will be the construction of a basis for the Hilbert space of the 
form 
[j, m, o) (3.34) 


where m and j refer to the J3 value and the representation as usual (as in 
(3.23) and o refers to all the other observables that can be diagonalized to 
characterize the state. These satisfy 


(^, m', af |j, m, a) = Omm jj bala (3.35) 


The Kronecker 5s are automatic consequences of our construction. They are 
also required by Schur’s lemma, because the matrix elements satisfy 


us m', e |Ja |J, m, a) 
= [Jum Gm", o|j, m, o) (3.36) 


= (',m', o |j, m", o) [Jd] mm 


because we can insert a complete set of intermediate states on either side of 
Ja. Thus (j',m', o |j, m, a) commutes with all the elements of an irre- 
ducible representation, and is either 0 if j 4 ;' or proportional to the identity, 
Óm'm if j =J 5 


3.4 Tensor products 


You have probably all used the highest weight scheme, possibly without 
knowing it, to do what in introductory quantum mechanics is called addi- 
tion of angular momentum. This occurs when we form a tensor product 
of two sets of states which transform under the group.* This happens, in turn, 
whenever a system responds to the group transformation in more than one 
way. The classic example of this is a particle that carries both spin and orbital 
angular momentum. In this case, the system can be described in a space that 
you can think of as built of a product of two different kinds of kets. 


|, m) = p) |z) (3.37) 


where the first states, |2) transforms under representation D, of the group 
and the second, |z}, under D2. Then the product, called the tensor product, 


^We saw an example of this in the normal modes of the triangle in our discussion of finite 
groups. 
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transforms as follows: 


D(g) ji, z) = |j, y) (Dig2 (9)]jyia 
= [3 ly) [D1 (g)];i [Do (9)]uz (3.38) 
= (7) [D1 (9)];) (ly) [D2(9)]uz) 


In other words, the two kets are just transforming independently under their 
own representations. If we look at this near the identity, for infinitesimal o, 
(1 + 2o J4) |i, z) 
= |j y) y| (1 + i&aJa) i; 2) 
= joy) (8s + io [297 (9)]jyiz) 
= jy) (55 + iaa[di];) (ôyz t io [J2]ys) 
Thus identifying first powers of ag 
[29^ (g)]jyis = [Ja liiduz + 95il7aluz (3.40) 


When we multiply the representations, the generators add, in the sense shown 
in (3.40). This is what happens with addition of angular momenta. We will 
often write (3.40) simply as 


(3.39) 


JIS8 JL ga (3.41) 


leaving you to figure out from the context where the indices go, and ignoring 
the ó-functions which, after all, are just identity operators on the appropriate 
space. In fact, you can think of this in terms of the action of the generators as 


follows: 
A (112) = (Jali) 02 + 9) (Jul) (3.42) 


3.5 J3 values add 


This is particularly simple for the generator J3 because we work in a basis in 
which J3 is diagonal. Thus the J3 values of tensor product states are just 
the sums of the J3 values of the factors: 


J3 (lj mı)lj2,m2)) = (mı + mz) (ljn ma )ja, ma)) (3.43) 


This is what we would expect, classically, for addition of angular momentum, 
of course. But in quantum mechanics, we can only make it work for one 
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component. We can, however, use this in the highest weight construction, 
(3.33). 

Consider, for example, the tensor product of a spin 1/2 and spin 1 rep- 
resentation. The highest weight procedure (3.33) is what you would use to 
decompose the product space into irreducible representations. Let's do it ex- 
plicitly. There is a unique highest weight state, 


[3/2, 3/2) = |1/2,1/2)]1, 1) (3.44) 


We can now construct the rest of the spin 3/2 states by applying lowering 
operators to both sides. For example using (3.42) 


J-j3/2,3/2) = J (D/2,1/2)]1, 1)) 


(3.45) 
= 15/19 = tue mt. + 11/2,1/2)11,0) 


or 
|3/2, 1/2) = "Hn —1/2)|1,1) + "ED 1/2)]1,0) (3.46) 
Continuing the process gives 
i5/2,-1/2 = 21/2, 121,0 + y 11/2, 1/3, -n 
[3/2, 3/2) = |1/2, —1/2)]1, —1) 


(3.47) 


Then the remaining states are orthogonal to these — 


of 310/2,-1/2)11,3) - «fBiny2,1/2)11,9) (3.48) 
and 
V Ina -/mn.o — Vana, -n (3.49) 


applying the highest weight scheme to this reduced space gives 


1/212 = V2n/2, 120.0 - af $10/2,1/2))1,0) 
2,12 = 3/2, -1/2)11,0) - 21/2,1/20. -) 


In this case, we have used up all the states, so the process terminates. Note 
that the signs of the spin 1/2 states were not determined when we found the 
states orthogonal to the spin 3/2 states, but that the relative sign is fixed be- 
cause the J3 = +1/2 states are related by the raising and lowering operators. 


(3.50) 
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Problems 
3.A. Use the highest weight decomposition, (3.33), to show that 
s+j 
U)e(s- X {9 
@l=|s—J| 


where the © in the summation just means that the sum is a direct sum, and 
{k} denotes the spin k representation of SU (2). To do this problem, you do 
not need to construct the precise linear combinations of states that appear in 
each irreducible representation, but you must at least show how the counting 
of states goes at each stage of the highest weight decomposition. 


3.B. Calculate 


er? 
where d are the Pauli matrices. Hint: write 7 = |F] ?. 


3.C. Show explicitly that the spin 1 representation obtained by the high- 
est weight procedure with 7 = 1 is equivalent to the adjoint representation, 
with fabe = €abe by finding the similarity transformation that implements the 
equivalence. 


3.D. Suppose that (c,];; and (n;];, are Pauli matrices in two different 
two dimensional spaces. In the four dimensional tensor product space, define 
the basis 


ID-li-Dl-1 B)-li-10z-2) 
3)=|6=2)le=1) 14) =li = 2e = 2) 
Write out the matrix elements of o2 ® m in this basis. 


3.E. We will often abbreviate the tensor product notation by leaving out 
the indices and the identity matrices. This makes for a very compact notation, 
but you must keep your wits about you to stay in the right space. In the 
example of problem 3.D, we could write: 


[ca]ijIno]zy aS Oat) 
[oslijózy as Sa 


- ÓOu[m]z, as m 
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ójó;, as 1 


So for example, (c1)(o271) = i037; and (0172)(013) = im. 

To get some practice with this notation, calculate 

(a) (Pa, Tone] , 

(b) Tr (Ta Um; acna}) , 

(c) [oin 0272] . 
where o, and na are independent sets of Pauli matrices and {A,B} = AB + 
BA is the “anticommutator.” 


Chapter 4 


Tensor Operators 


A tensor operator is a set of operators that transforms under commutation 
with the generators of some Lie algebra like an irreducible representation of 
the algebra. In this chapter, we will define and discuss tensor operators for 
the SU (2) algebra discussed in chapter 3. A tensor operator transforming 
under the spin-s representation of SU (2) consists of a set of operators, O; 
for £ = 1 to 2s+1 (or —s to s), such that 


[Jas O?] = Om [Ja]me- (4.1) 


It is true, though we have not proved it, that every irreducible representa- 
tion is finite dimensional and equivalent to one of the representations that we 
found with the highest weight construction. We can always choose all tensor 
operators for SU (2) to have this form. 


4.1 Orbital angular momentum 


Here is an example — a particle in a spherically symmetric potential. If the 
particle has no spin, then Ja is the orbital angular momentum operator, 


Ja = La = €abc Tb Dc (4.2) 


The position vector is related to a tensor operator because it transforms under 
the adjoint representation 


E Ta] = €acd [re Dd; ro] = —t€acd lc Oba (4.3) 


= —llach Te = Te [J2] o 
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where Jè is the adjoint representation, and we know from problem 3.C that 
this representation is equivalent to the standard spin 1 representation from the 
highest weight procedure. 


4.2 Using tensor operators 


Note that the transformation of the position operator in (4.3) does not have 
quite the right form, because the representation matrices J24) are not the stan- 
dard form. The first step in using tensor operators is to choose the operator 
basis so that the conventional spin s representation appears in the commuta- 
tion relation (4.1). This is not absolutely necessary, but it makes things easier, 
as we will see. We will discuss this process in general, and then see how it 
works for ra. 

Suppose that we are given a set of operators, Q, for z = 1 to 2s41 
that transforms according a representation D that is equivalent to the spin-s 
representation of SU (2): 


[Ja; A = Qy LAT (4.4) 


Since by assumption, D is equivalent to the spin-s representation, we can find 
a matrix S such that 
SJPg- Js (4.5) 


or in terms of matrix elements 
[S]e [3205 [S ye n PAP (4.6) 


Then we define a new set of operators 


Oj = y [S]e for l = —s tos (4.7) 
Now Oj satisfies 

[Ja Ot] 

= [Ja, Qy] [S "lye 

= Q7 ley [S "lye (48) 


= 2,[S "ee [S]e 2 [27129 [S ye 
= Op Jale« 
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which is what we want. Notice that (4.8) is particularly simple for J3, because 
in our standard basis in which the indices £ label the Jz value, J+, (or J$ for 
any s) is a diagonal matrix 


[J3]p; =le forl, l = —s tos. (4.9) 


Thus 
[Js, Of] = Of [J$]e« = £01. (4.10) 
In practice, it is usually not necessary to find the matrix S explicitly. If 
we can find any linear combination of the (2, which has a definite value of 
J3 (that means that it is proportional to its commutator with J3), we can take 
that to be a component of O°, and then build up all the other O° components 
by applying raising and lowering operators. 
For the position operator it is easiest to start by finding the operator ro. 
Since [J3, r3] = 0, we know that r3 has J3 = 0 and therefore that r3 œ ro. 


Thus we can take 
TQ — T3 (4.1 1) 
Then the commutation relations for the spin 1 raising and lowering operators 
give the rest 
J*,rg]- 
Urol na | (4.12) 
= (rı +ir2)/ v2 


4.3 The Wigner-Eckart theorem 


The interesting thing about tensor operators is how the product Oj |j, m, a) 
transforms. 


Ja Of |j, m, a) 
= (Ja, 0j] |j, m, a) + OF Ja |j, m, o) (4.13) 
= On lj, m, a) PATT, T O; |j, m/ , a) [42 Lain 
This is the transformation law for a tensor product of spin s and spin j, s & J. 
Because we are using the standard basis for the states and operators in which 
J3 is diagonal, this is particularly simple for the generator J3, for which (4.13) 


becomes 
Jz O; |j, m, a) = (L + m) Of |j, m, o) (4.14) 


The J3 value of the product of a tensor operator with a state is just the 
sum of the Jz} values of the operator and the state. 
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The remarkable thing about this is that the product of the tensor operator 
and the ket behaves under the algebra just like the tensor product of two 
kets. Thus we can decompose it into irreducible representations in exactly 
the same way, using the highest weight procedure. That is, we note that 
O: |j, j, œ) with J3 = j + s is the highest weight state. We can lower it 
to construct the rest of the spin j + s representation. Then we can find the 
linear combination of J3 = 7 + s — 1 states that is the highest weight of the 
spin 7 + s — 1 representation, and lower it to get the entire representation, 
and so on. In this way, we find explicit representations for the states of the 
irreducible components of the tensor product in terms of linear combinations 
of the O; |j, m, o). You probably know, and have shown explicitly in problem 
3.A, that in this decomposition, each representation from j + s to |j — s| 
appears exactly once. We can write the result of the highest weight analysis 
as follows: 


Y. Oil M — £a) (5,3, M — £ | J, M) = ky |J, M) (4.15) 
£ 


Here |J, M) is a normalized state that transforms like the J3 = M compo- 
nent of the spin J representation and ky is an unknown constant for each J 
(but does not depend on M). The coefficients (s, j, £, M — £ | J, M) are 
determined by the highest weight construction, and can be evaluated from 
the tensor product of kets, where all the normalizations are known and the 
constants ky are equal to 1: 


»»IDD |j, M — £) (5,3, M — £| J,M) F |J, M) (4.16) 
£ 


One way to prove! that the coefficients can be taken to be the same in (4.15) 
and (4.16) is to notice that in both cases, J+ |J, J) must vanish and that 
this condition determines the coefficients (s, j, £, J — £ | J, J} up to a mul- 
tiplicative constant. Since the transformation properties of O% |j, m} and 
|s, £) |j, m) are identical, the coefficients must be proportional. The only 
difference is the factor of ky in (4.15). 

We can invert (4.15) and express the original product states as linear com- 
binations of the states with definite total spin J. 


js 
Ojljm,o)-2 M (A4*m|sj,£ m)k;|J,£4 m) (4.17) 
J=\j-s| 


'This is probably obvious, but as we will emphasize below, the operators are different 
because we do not have a scalar product for them. 


72 CHAPTER 4. TENSOR OPERATORS 


The coefficients (J, M | s,7,2,M — £) are thus entirely determined by 
the algebra, up to some choices of the phases of the states. Once we have a 
convention for fixing these phases, we can make tables of these coefficients 
once and for all, and be done with it. The notation (J, + m|s, j, £, m) just 
means the coefficient of |J, £ + m) in the product |s, Z) |j, m). These are 
called Clebsch-Gordan coefficients. 

The Clebsch-Gordan coefficients are all group theory. The physics comes 
in when we reexpress the |J, £ + m) in terms of the Hilbert space basis states 
|J, L+ m, p) CE 


kj |J,£ +m) 2 M. kag |J, £ +m, B) (4.18) 
B 


We have absorbed the unknown coefficients ky into the equally unknown co- 
efficients kag. These depend on a, j, O? and s, because the original products 
do, and on 8 and J, of course. But they do not depend at all on £ or m. We 
only need to know the coefficients for one value of l+ m. The kag are called 
reduced matrix elements and denoted 


kag = (J, p| O° |j, o) (4.19) 


Putting all this together, we get the Wigner-Eckart theorem for matrix ele- 
ments of tensor operators: 


(J, m^, B| Of |j, m, o) 
= Om! £-m (LE + ms, j, £,m) ` (J, 8| O° |j, o) 


(4.20) 


If we know any non-zero matrix element of a tensor operator between states 
of some given J, B and j, œ, we can compute all the others using the algebra. 
This sounds pretty amazing, but all that is really going on is that we can use 
the raising and lowering operators to go up and down within representations 
using pure group theory. Thus by clever use of the raising and lowering oper- 
ators, we can compute any matrix element from another. The Wigner-Eckart 
theorem just expresses this formally. 


4.4 Example 


Suppose 
Find 
(1/2, 1/2, 0| 1 1/2, —1/2, 8) =? (4.22) 
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First, since ro = rs, 
(1/2, 1/2, a| ro |1/2, 1/2, 8) = A (4.23) 


Then we know from (4.12) that 


1 
T] = pO + r1) (4.24) 


Thus 
(1/2,1/2, a| r1 |1/2, —1/2, B) 


= (/2,1/2,o| Cre +r1)[1/2,—1/28) —— (425) 


=~ (1/2,1/2,a 744 |1/2,—1/2, 6) 

Now we could plug this into the formula, and you could find the Clebsch- 
Gordan coefficients in a table. But I'll be honest with you. I can never re- 
member what the definitions in the formula are long enough to use it. In- 
stead, I try to understand what the formula means, and I suggest that you do 
the same. We could also just use what we have already done, decomposing 
1/2 & 1 into irreducible representations. For example, we know from the 
highest weight construction that 


3/2, 3/2) = r1 |1/2, 1/2, B) (4.26) 


is a 3/2,3/2 state because it is the highest weight state that we can get as a 
product of an rg operator acting on an |1/2, m} state. Then we can get the 
corresponding |3/2, 1/2) state in the same representation by acting with the 
lowering operator J^ 


|3/2,1/2) = "Ha |3/2, 3/2) 
= Er |1/2, 1/2, 8) + Jira }1/2, —1/2, B) 


But we know that this spin-3/2 state has zero matrix element with any spin- 
1/2 state, and thus 


(4.27) 


0 = (1/2, 1/2, a| 3/2, 1/2) 
= [2(0/2,1/2, al 0 |1/2,1/2,8) (4.28) 
tan 1/2, a| r.1]1/2, —1/2, B) 
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SO 
(1/2,1/2, a r41 |1/2, 1/2, B) 
= —V2(1/2, 1/2, a| ro 1/2, 1/2, 8) (4.29) 
=-V2A 
SO 
(1/2, 1/2, o| r1 |1/2, —1/2, 8} = A (4.30) 


Although we did not need it here, we can also conclude that 


11/2,1/2) = Jin 1/2, 1/2, a) — Vina |1/2,-1/2,a) | (431) 


is a 1/2,1/2 state. This statement is actually a little subtle, and shows the 
power of the algebra. When we did this analysis for the tensor product of 7-1 
and 7=1/2 states, we used the fact that the |1/2, 1/2) must be orthogonal to 
the |3/2, 1/2) states to find the form of the |1/2, 1/2) state. We cannot do this 
here, because we do not know from the symmetry alone how to determine the 


norms of the states 
re |1/2, m) (4.32) 


However, we know from the analysis with the states and the fact that the 
transformation of these objects is analogous that 


J 11/2 1/2) =0 (4.33) 


Thus it is a 1/2,1/2 state because it is the highest weight state in the represen- 
tation. We will return to this issue later. 

There are several ways of approaching such questions. Here is another 
way. Consider the matrix elements 


(1/2, m, a| ra |1/2, m^, B) (4.34) 


The Wigner-Eckart theorem implies that these matrix elements are all propor- 
tional to a single parameter, the kag. Furthermore, this result is a consequence 
of the algebra alone. Any operator that has the same commutation relations 
with J, will have matrix elements proportional to rą. But Ja itself has the 
same commutation relations. Thus the matrix elements of ra are proportional 
to those of Ja. This is only helpful if the matrix elements of J, are not zero 
(if they are all zero, the Wigner-Eckart theorem is trivially satisfied). In this 
case, they are not (at least if a = 8) 


(1/2, m, | Jo |1/2,m', B) = Jn slo lm (4.35) 
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Thus 
(1/2, m, œ| Ta |1/2, m, 8) « [oa] mm (4.36) 


This gives the same result. 


4.5 * Making tensor operators 


If often happens that you come upon a set of operators which transforms 
under commutation with the generators like a reducible representation of the 


algebra 
[Ja Q] = LE (4.37) 


where D is reducible. In this case, some work is required to turn these into 
tensor operators, but the work is essentially just the familiar highest weight 
construction again. The first step is to make linear combinations of the (1; 
operators that have definite J3 values 


[J3, Om,a] = M One (4.38) 


This is always possible because D can be decomposed into irreducible rep- 
resentations that have this property. Then we can apply the highest weight 
procedure and conclude that the operators, with the highest weight, O; aœ are 
components of a tensor operator with spin 7, one for each o. If there are any 
operators with weight 7—1/2, O;_1/2,, they will be components of tensor 
operators with spin 7—1/2. However, things can get subtle at the next level. 
To find the tensor operators with spin 7—1, you must find linear combinations 
of the operators with weight 7—1 which have vanishing commutator with J+ 
— then they correspond to the highest weights of the spin ;— 1 reps 


[42070] (4.39) 


The point is, if you get the operators in a random basis, you have nothing like 
a scalar product, so you cannot simply find the operators that are “orthogo- 
nal" to the ones you have already assigned to representations. I hope that an 
example will make this clearer. Consider seven operators, a1, b. and ap, 
bo and co, with the following commutation relations with the generators: 


[J3, a4] — 041 [J3, b41] = bii 
[J3, ao] = [J3, bo] = [Js, co] =0 (4.40) 
[3 a1] = -a1 = [Js,b4] = —b-1 
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Il 
o 


[J*,a41] = [75,54] 
[J*, ao] = a1 [J*, bo] = b41 


[J*,a-1] = co [J*,b-1] = 


_ 1 e 1 
[J 441] = 3 (0 + bo + Co) [J NEN = 5 (20 + bo — co) 
[J^ , ao] = 28.145. [J^ bo] =a +b [J^ co] =A.) 


eq) = bal eb 
(4.42) 
To construct the tensor operators, we start with the highest weight states, 
and define 


Ay = 41 By = 641 (4.43) 


Then we construct the rest of the components by applying the lowering oper- 
ators 


1 1 
Ao = 5(a0 + bo +o) Bo = 5(a0 + bo — co) C8) 


and 
A- =2a_;+06_) Bia ab. (4.45) 


You can check that the raising operators now just move us back up within the 
representations. 

Now there is one operator left, so it must be a spin O representation. 
But which one is it? It must be the linear combination that has vanishing 
commutator with J+ — therefore it is 


Co = a9 — bo — Co (4.46) 


Let me emphasize again that we went through this analysis explicitly 
to show the differences between dealing with states and dealing with tensor 
operators. Had this been a set of seven states transforming similarly under 
the algebra, we could have constructed the singlet state by simply finding 
the linear combination of J3 = 0 states orthogonal to the J4 = 0 states in 
the triplets. Here we do not have this crutch, but we can still find the singlet 
operator directly from the commutation relations. We could do the same thing 
for states, of course, but it is usually easier for states to use the nice properties 
of the scalar product. 
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4.6 Products of operators 


One of the reasons that tensor operators are important is that a product of two 


tensor operators, Of}, and O72, in the spin sı and spin s2 representations, 


transforms under the tensor product representation, s; & s2 because 
[ta Oe Od 
= [Ja, 03] O FO. [Ja OR] (4.47) 
== Oi On [Xa ]m m F Qn Omi, [Xa lmm 


Thus the product can be decomposed into tensor operators using the highest 
weight procedure. 

Note that as usual, things are particularly simple for the generator J3. 
(4.47) implies 


[J3 O21, O32,] = (mı + mz) OF, O8, (4.48) 


The J3 value of the product of two tensor operators is just the sum of the J3 
values of the two operators in the product. 


Problems 


4.A. Consider an operator Oz, for x = 1 to 2, transforming according to 
the spin 1/2 representation as follows: 


[Ja Oz] = Oy [Ca]ys/2 
where o, are the Pauli matrices. Given 
(3/2, —1/2, a| O1 |1, 1,8) =A 
find 
(3/2, —3/2, a| O2 |1, —1, B) 


4.B. The operator (r41)? satisfies 
[L*, (r?] =0 


It is therefore the O;2 component of a spin 2 tensor operator. Construct the 
other components, Om. Note that the product of tensor operators transforms 
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like the tensor product of their representations. What is the connection of 
this with the spherical harmonics, Y;,,(0, 9)? Hint: let rı = sin6cos Q, 
r9 = sinÜsin à, and rg = cos. Can you generalize this construction to 
arbitrary £ and explain what is going on? 


4.C. Find 
giao X. d 


where the X] are given by (3.31) Hint: There is a trick that makes this one 
easy. Write 
o, Xl = oà,Xl 


where 

a= SMe; Age = 1 
You know that &, X1 has eigenvalues +1 and 0, just like X} (because all 
directions are equivalent). Thus (ó, X, 1)? is a projection operator and 


(à, X1) = (&a X1) 


You should be able to use this to manipulate the expansion of the exponential 
and get an explicit expression for gita X3. 


Chapter 5 
Isospin 


The idea of isospin arose in nuclear physics in the early thirties. Heisenberg 
introduced a notation in which the proton and neutron were treated as two 
components of a nucleon doublet 


N= i (5.1) 


He did this originally because he was trying to think about the forces between 
nucleons in nuclei, and it was mathematically convenient to write things in 
this notation. In fact, his first ideas about this were totally wrong — he re- 
ally didn't have the right idea about the relation between the proton and the 
neutron. He was thinking of the neutron as a sort of tightly bound state of 
proton and electron, and imagined that forces between nucleons could arise 
by exchange of electrons. In this way you could get a force between proton 
and neutron by letting the electron shuttle back and forth — in analogy with 
an Hj ion, and a force between neutron and neutron — an analogy with a 
neutral H5 molecule. But no force between proton and proton. 


5.1 Charge independence 


It was soon realized that the model was crazy, and the force had to be charge 
independent — the same between pp, pn and nn to account for the pat- 
tern of nuclei that were observed. But while his model was crazy, he had 
put the p and n together in a doublet, and he had used the Pauli matrices to 
describe their interactions. Various people soon realized that charge indepen- 
dence would be automatic if there were really a conserved "spin" that acted 
on the doublet of p and n just as ordinary spin acts on the two J4 compo- 
nents of a spin-1/2 representation. Some people called this “isobaric spin", 
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which made sense, because isobars are nuclei with the same total number of 
baryons,! protons plus neutrons, and thus the transformations could move 
from one isobar to another. Unfortunately, Wigner called it isotopic spin and 
that name stuck. This name makes no sense at all because the isotopes have 
the same number of protons and different numbers of neutrons, so eventually, 
the "topic" got dropped, and it is now called isospin. 


5.2 Creation operators 


Isospin really gets interesting in particle physics, where particles are routinely 
created and destroyed. The natural language for describing this dynamics is 
based on creation and annihilation operators (and this language is very useful 
for nuclear physics, as we will see). For example, for the nucleon doublet in 
(5.1), we can write 


|p, o) = af, , 10) 
n.o) = ah, 4 10) (5.2) 
where the i 
Oy Elo (5.3) 


are creation operators for proton (+5) and neutron -5 respectively in the 
state a, and |0) is the vacuum state — the state with no particles in it. The N 
stands for nucleon, and it is important to give it a name because we will soon 
discuss creation operators for other particles as well. The creation operators 
are not hermitian. Their adjoints are annihilation operators, 

QN lo (5.4) 
These operators annihilate a proton (or a neutron) if they can find one, and 
otherwise annihilate the state, so they satisfy 


aN +} al0) = 0 (5.5) 


The whole notation assumes that the symmetry that rotates proton into neu- 
tron is at least approximately correct. If the proton and the neutron were not 
in some sense similar, it wouldn’t make any sense to talk about them being in 
the same state. 

'Baryons are particles like protons and neutrons. More generally, the baryon number is 


one third the number of quarks. Because, as we will discuss in more detail later, the proton 
and the neutron are each made of three quarks, each has baryon number 1. 
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Because the p and n are fermions, their creation and annihilation opera- 
tors satisfy anticommutation relations: 


fis a mya} = mm Sap (5.6) 
faka aN, m'g} = {@n,ma, ON m! 5] =0 


With creation and annihilation operators, we can make multiparticle states 
by simply applying more than one creation operator to the vacuum state. For 


example 
n proton creation operators 


MMM— 
t MN 
LED (5.7) 
« [n protons; a,--+,@n) 


produces an n proton state, with the protons in states, o through an. The 
anticommutation relation implies that the state is completely antisymmetric 
in the labels of the particles. This guarantees that the state vanishes if any 
two of the os are the same. It means (among other things) that the Pauli 
exclusion principle is automatically satisfied. What is nice about the creation 
and annihilation operators is that we can construct states with both protons 
and neutrons in the same way. For example, 


n nucleon creation operators 
TN, 
t ASi 
€ Nm, o1 ON m, orn |0) (5.8) 
x |n nucleon; mi,01;:::;ma4,04) 


is an n nucleon state, with the nucleons in states described by the m variable 
(which tells you whether it is a proton or a neutron) and the a label, which 
tells you what state the nucleon is in. Now the anticommutation relation 
implies that the state is completely antisymmetric under exchange of the pairs 
of labels, m and a. 


[n nucleon; m1, a1; M2, Q2°++; Mn, A) (5.9) 
= —|n nucleon; m5, 095; m1, 01; +++; ma, Qn) 


If you haven’t seen this before, it should bother you. It is one thing to assume 
that the proton creation operators anticommute, because two protons really 
cannot be in the same state. But why should proton and neutron creation 
operators anticommute? This principle is called the “generalized exclusion 
principle.” Why should it be true? This is an important question, and we will 
come back to it below. For now, however, we will just see how the creation 
and annihilation operators behave in some examples. 
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5.3 Number operators 


We can make operators that count the number of protons and neutrons by 
putting creation and annihilation operators together (the summation conven- 
tion is assumed): 


t 
ON 4 lo AN Ll counts protons 


ON 1a Nera counts neutrons (5.10) 


1 
QN. m,a ON,m,a counts nucleons 


Acting on any state with N, protons and N» neutrons, these operators have 
eigenvalues Np, Nn and Np + Nn respectively. This works because of (5.5) 
and the fact that for a generic pair of creation and annihilation operators 


lata, at] — al (5.11) 


Notice that the number operators in (5.10) are summed over all the possible 
quantum states of the proton and neutron, labeled by o. If we did not sum 
over a, the operators would just count the number of protons or neutrons or 
both in the state a. We could get fancy and devise more restricted number 
operators where we sum over some a and not others, but we won't talk further 
about such things. The total number operators, summed over all o, will be 
particularly useful. 


5.4 Isospin generators 


For the one-particle states, we know how the generators of isospin symmetry 
should act, in analogy with the spin generators: 


1 
Talm, a) = |m, a) [Ja lmm = zm’, a) [omm 613 
Or in terms of creation operators 
t t 1/2 1 t 
Ta C Nm. |0) = ON m'a 0) [Ja "1mm = ZIN, m,a lO [ca]mm (5.13) 


Furthermore, the state with no particles should transform like the trivial rep- 
resentation — 
Ta |0) = 0 (5.14) 
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Thus we will get the right transformation properties for the one particle states 
if the creation operators transform like a tensor operator in the spin 1/2 rep- 
resentation under isospin: 

1 


[To akma] = ahm Ja! ntm = Nm a Ta] (5.15) 


It is easy to check that the following form for Ta does the trick: 


Ta = a m,a LP n aN,m,a ds 
o Pa] m/m aN ma Ttc (5.16) 


= Eg Og QN. cc 

where --- commutes with the nucleon creations and annihilation operators 
(and also annihilates |0)). The last line is written in matrix form, where we 
think of the annihilation operators as column vectors and the creation op- 
erators as row vectors. Let us check that (5.16) has the nght commutation 
relations with the creation operators so that (5.15) is satisfied. 


H5. M 


= [a e.a UP et QN,m' fl di 
= Ghote [JI or [ange ana] (5.17) 
= lass üb s (Ii mwm” QN m” 6B 


= ai E al oan 


The advantage of thinking about the generators in this way is that we now 
immediately see how multiparticle states transform. Since the multiparticle 
states are built by applying more tensor (creation) operators to the vacuum 
state, the multiparticle states transform like tensor products — not a surpris- 
ing result, but not entirely trivial either. 


5.5 Symmetry of tensor products 


We pause here to discuss an important fact about the combination of spin 
states (either ordinary spin or isospin). We will use it in the next section to 
discuss the deuteron. The result is this: when the tensor product of two iden- 
tical spin 1/2 representations is decomposed into irreducible representations, 


84 CHAPTER 5. ISOSPIN 


the spin 1 representation appears symmetrically, while the spin 0 appears an- 
tisymmetrically. To see what this means, suppose that the spin 1/2 states are 


(1/2, 21/2, o) (5.18) 


where a indicates whatever other parameters are required to describe the 
state. Now consider the highest weight state in the tensor product. This is 
the spin 1 combination of two identical J3=1/2 states, and is thus symmetric 
in the exchange of the other labels: 


|1,1) = |1/2,1/2,0)|1/2, 1/2, 8) = |1/2,1/2,8)|1/2,1/2,0) — (5.19) 


The lowering operators that produce the other states in the spin 1 representa- 
tion preserve this symmetry because they act in the same way on the two spin 
1/2 states. 1 
— | |1/2, 71/2, a)|1/2, 1/2, 
J (11/2 —1/2,0) 11/2, 1/2, 8 


+{1/2,1/2,a)|1/2, -1/2, 2y (5.20) 


(1, -1) = |1/2, —-1/2, a)|1/2, —1/2, B) 


Then the orthogonal spin O state is antisymmetric in the exchange of o and 


B: 


[1, 0) = 


5 (hus, —1/2, a)|1/2, 1/2, 8) 


-1/2,1/2,2)1/2, 1/2, 


0, 0) = 
(5.21) 


5.6 The deuteron 


The nucleons have spin 1/2 as well as isospin 1/2, so the a in the nucleon 
creation operator actually contains a J3 label, in addition to whatever other 
parameters are required to determine the state. 

As a simple example of the transformation of a multiparticle state, con- 
sider a state of two nucleons in an s-wave — a zero angular momentum state. 
Then the total angular momentum of the state is simply the spin angular mo- 
mentum, the sum of the two nucleon spins. Furthermore, in an s-wave state, 
the wave function is symmetrical in the exchange of the position variables 
of the two nucleons. Then because the two-particle wave function is pro- 
portional to the product of two anticommuting creation operators acting on 
the vacuum state, it is antisymmetric under the simultaneous exchange of the 
isospin and spin labels of the two nucleons — if the spin representation is 
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symmetric, the isospin representation must be antisymmetric, and vice versa. 
When combined with the results of the previous section, this has physical 
consequences. The only allowed states are those with isospin 1 and spin O or 
with isospin O and spin 1. The deuteron is an isospin 0 conbination, and has 
spin 1, as expected. 


5.7 Superselection rules 


It appears, in this argument, that we have assigned some fundamental physical 
significance to the anticommutation of the creation operators for protons and 
neutrons. As I mentioned above, this seems suspect, because in fact, the 
proton and neutron are not identical particles. What we actually know directly 
from the Pauli exclusion principle is that the creation operator, aj, for any 
state of a particle obeying Fermi-Dirac statistics satisfies 


C =0 (5.22) 


If we have another creation operator for the same particle in another state, 
a}, we can form the combination al, + a}, which when acting on the vacuum 
creates the particle in the state œ + 8 (with the wrong normalization). Thus 


the exclusion principle also implies 


(ai rs ab) =0 (5.23) 


and thus 

fatah} =0 (5.24) 
This argument is formally correct, but it doesn't really make much physi- 
cal sense if a], and al create states of different particles, because it doesn't 
really make sense to superpose the states — this superposition is forbidden 
by a superselection rule. À superselection rule is a funny concept. It is the 
statement that you never need to think about superposing states with different 
values of an exactly conserved quantum number because those states must be 
orthogonal. Anything you can derive by such a superposition must also be 
derivable in some other way that does not involve the “forbidden” superpo- 
sition. Thus as you see, the superposition is not so much forbidden as it is 
irrelevant. In this case, it is possible to show that one can choose the creation 
operators to anticommute without running into inconsistencies, but there is a 
much stronger argument. The anticommutation is required by the fact that the 
creation operators transform like tensor operators. Let's see how this implies 
the stated resuit for the two nucleon system. 
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Call the creation operators for the baryons al, (dropping the N for 
brevity) where the first sign is the sign of the third component of isospin and 


the second is the sign of third component of spin. Since e JJ = 0, there 
is no two nucleon state with T5 = 1 and J3 = 1. But this means that there is 
no state with isospin 1 and spin 1, since the highest weight state would have 
to have T3 = 1 and Js = 1. In terms of creation operators, for example 


fr, (4) _ {at,,at,} =0 (5.25) 


Similar arguments show that the operators must anticommute whenever they 
have one common index and the others are different. 

The argument for operators that have no index in common is a little more 
subtle. First compute 


[|r (ata) |] = (tate) (cleat }=o — eae 


But the two terms in the sum must separately vanish because they are phys- 
ically distinguishable. There cannot be a relation like (5.26) unless the two 
operators 


{at_,al,} 0) (5.27) 


and 
{al ,,a4_} 10) (5.28) 


separately vanish, because these two operators, if they did not vanish, would 
do physically distinguishable things — the creation of a proton with spin up 
and a neutron with spin down is not the same as the creation of proton with 
spin down and a neutron with spin up. Thus the operators (5.27) and (5.28) 
must separately vanish. Thus, not only does the isospin 1, spin 1 state (5.26) 
vanish but so also does the isospin 0, spin O state 


fat af} 10) - {at,,a4_} 10) (5.29) 


5.8 Other particles 


When isospin was introduced, the only known particles that carried it were 
the proton and neutron, and the nuclei built out of them. But as particle 
physicists explored further, at higher energies, new particles appeared that 
are not built out of nucleons. The first of these were the pions, three spinless 
bosons (that is obeying Bose-Einstein, rather than Fermi-Dirac statistics) with 
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charges Q = +1, 0 and —1, and 73 = Q, forming an isospin triplet.? The 
creation and annihilation operators for the pions can be written as 


t 
an ma? 


armo form =-—ltol (5.30) 


They satisfy commutation, rather than anticommutation relations 


een cun] == mm! 5a6 


(5.31) 
[at asl ana = [ar m,a: ar m! 6] =0 


so that the particle states will be completely symmetric. They also commute 
with nucleon creation and annihilation operators. 
The isospin generators look like 


T, =a} pees: armat: (5.32) 


T, m,Q 
where as in (5.16) the --- refers to the contributions of other particles (like 
nucleons). Again, then the creation operators are tensor operators. 

There are many many other particles like the nucleons and the pions that 
participate in the strong interactions and carry isospin. The formalism of cre- 
ation and annihilation operators gives us a nice way of writing the generators 
of isospin that acts on all these particles. The complete form of the isospin 
generators is 


To = p» ala Rpg Ar m'a (5.33) 


particles x 
states a 
T3 values m,m’ 


where al, m a Nd Gz m'a are creation and annihilation operators for z-type 
particles satisfying commutation or anticommutation relations depending on 
whether they are bosons or fermions, 


asma: abd al 4 = mm obrr’ (5 34) 


t 2 = 
[al nateli m [ar m,a Ox! m',5] 4 =0 


The rule for the + (+ for anticommutator, — for commutator) is that the anti- 
commutator is used when both z and z' are fermions, otherwise the commu- 
tator is used. The 7, in (5.33) is the isospin of the x particles. 


?When these particles were discovered, it was not completely obvious that they were not 
built out of nucleons and their antiparticles. When very little was known about the strong 
interactions, it was possible to imagine, for example, that the 7* was a bound state of a proton 
and an antineutron. This has all the right quantum numbers — even the isospin is right. It just 
turns out that this model of the pion is wrong. Group theory can never tell you this kind of 
thing. You need real dynamical information about the strong interactions. 
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5.9 Approximate isospin symmetry 


Isospin is an approximate symmetry. What this means in general is that the 
Hamiltonian can be written as 


H=H)+AH (5.35) 


where Hp commutes with the symmetry generators and AH does not, but 
in some sense AH is small compared to Ho. It is traditional to say in the 
case of isospin that the "strong" interactions are isospin symmetric while 
the weak and electromagnetic interactions are not, and so take Hp = Hg 
and AH = Hgm + Hy where Hs, Hgm and Hy are the contributions 
to the Hamiltonian describing the strong interactions (including the kinetic 
energy), the electromagnetic interactions, and the weak interactions, respec- 
tively. From our modern perspective, this division is a bit misleading for two 
reasons. Firstly, the division between electromagnetic and weak interactions 
is not so obvious because of the partial unification of the two forces. Sec- 
ondly, part of the isospin violating interaction arises from the difference in 
mass between the u and d quarks which is actually part of the kinetic energy. 
It seems to be purely accidental that this effect is roughly the same size as the 
effect of the electromagnetic interactions. But this accident was important 
historically, because it made it easy to understand isospin as an approximate 
symmetry. There are so many such accidents in particle physics that it makes 
one wonder whether there is something more going on. At any rate, we will 
simply lump all isospin violation into AH. The group theory doesn't care 
about the dynamics anyway, as long as the symmetry structure is properly 
taken into account. 


5.10 Perturbation theory 


The way (5.35) is used is in perturbation theory. The states are classified 
into eigenstates of the zeroth order, isospin symmetric part of the Hamilto- 
nian, Hg. Sometimes, just Hp is good enough to approximate the physics 
of interest. If not, one must treat the effects of AH as perturbations. In the 
scattering of strongly interacting particles, for example, the weak and electro- 
magnetic interactions can often be ignored. Thus in pion-nucleon scattering, 
all the different possible charge states have either isospin 1/2 or 3/2 (because 
1@1/2 = 3/2 & 1/2), so this scattering process can be described approxi- 
mately by only two amplitudes. 

The mathematics here is exactly the same as that which appears in the 
decomposition of a spin-1/2 state with an orbital angular momentum 1 into 
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states with total angular momentum 3/2 and 1/2. The state with one pion and 
one nucleon can be described as a tensor product of an isospin 1/2 nucleon 
state with an isospin 1 pion state, just as the state with both spin and orbital 
angular momentum can be described as a tensor product, having both spin 
and angular momentum indices. 


Problems 


S.A. Suppose that in some process, a pair of pions is produced in a state 
with zero relative orbital angular momentum. What total isospin values are 
possible for this state? 


5.B. Show that the operators defined in (5.33) have the commutation 
relations of isospin generators. 


5.C. A**, At, A? and A- are isospin 3/2 particles (Ts = 3/2, 1/2, 
—1/2 and —3/2 respectively) with baryon number 1. They are produced 
by strong interactions in 7-nucleon collisions. Compare the probability of 
producing A*^* in t+ P — A++ with the probability of producing A? in 
rTP — AP. 


Chapter 6 


Roots and Weights 


Now we are going to generalize the analysis of the representations of the 
SU (2) algebra to an arbitrary simple Lie algebra. The idea is simple. First, 
we do what we always try to do in quantum mechanics — find the largest 
possible set of commuting hermitian observables and use their eigenvalues to 
label the states. In this case, our observables will be the largest set of hermi- 
tian generators we can find that commute with one another, and can therefore 
be simultaneously diagonalized. Their eigenvalues will be the analog of J3. 
The rest of the generators will be analogous to the raising and lowering op- 
erators in SU (2). We will find that every raising operator corresponds to an 
SU (2) subgroup of the Lie algebra, and then we can use what we know about 
SU (2) to learn about the larger algebra. 


6.1 Weights 


We want the largest possible set of commuting hermitian generators because 
we want to diagonalize as much as possible. A subset of commuting hermi- 
tian generators which is as large as possible is called a Cartan subalgebra. 
It will turn out that the Cartan subalgebra is essentially unique, in that any 
one we choose will give the same results. 

In a particular irreducible representation, D, there will be a number of 
hermitian generators, H; for i = 1 to m, corresponding to the elements of the 
Cartan Subalgebra called the Cartan generators satisfying 


H,=H!, and  [HjHj]-0. (6.1) 


The Cartan generators form a linear space. Thus we can choose a basis in 
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which they satisfy 
Tr (H;H;) = kpói; for1,7 =ltom (6.2) 


where k is some constant that depends on the representation and on the nor- 
malization of the generators. The integer m, the number of independent Car- 
tan generators, is called the rank of the algebra. 

Of course, the point is that the Cartan generators can be simultaneously 
diagonalized. After diagonalization of the Cartan generators, the states of the 
representation D can be written as |, £, D) where 

Aj|u, £, D) = pilu, £, D) (6.3) 
and z is any other label that is necessary to specify the state. 

The eigenvalues p; are called weights. They are real, because they are 
eigenvalues of hermitian operators. The m-component vector with compo- 
nents u; is the weight vector. We will often use a vector notation in which 


Q&Q: u = Qipi and o? = aijai (6.4) 


6.2 More on the adjoint representation 


The adjoint representation, defined by (2.27), is particularly important. Be- 
cause the rows and columns of the matrices defined by (2.27) are labeled by 
the same index that labels the generators, the states of the adjoint representa- 
tion correspond to the generators themselves. We will denote the state in the 
adjoint representation corresponding to an arbitrary generator Xa as 


IXa) - (6.5) 


Linear combinations of these states correspond to linear combinations of the 
generators — 


o|Xa) + B|Xs) = |aXa + Xo). (6.6) 
A convenient scalar product on this space is the following:! 
(X) = A7! Tr (LX) , (6.7) 


(A is what we called kp for the adjoint representation — see (2.37)). Now 
using (6.6) and (2.27), we can compute the action of a generator on a state, 
as follows: 
Xal Xi) = |X) (X¢|Xa|Xo) = 1X9 [Taleo = ~i facs|Xe) 
E i fabel Xc) = li foscXc) = \[Xe; Xy) . 


'We need the dagger because we will be led to consider complex linear combinations of 
the generators, analogous to the raising and lowering operators for SU (2). 


(6.8) 
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6.3 Roots 


The roots are the weights of the adjoint representation. Because [H;, H;] = 
0, the states corresponding to the Cartan generators have zero weight vectors 


Hi|H;) = \[Hi, Hj]) = 0 (6.9) 


Furthermore, all states in the adjoint representation with zero weight vectors 

correspond to Cartan generators. Because of (6.2), the Cartan states are or- 
thonormal, 

(Hi|Hj) = A^ Ty (HiH;) = óij (6.10) 

The other states of the adjoint representation, those not corresponding to 

the Cartan generators, have non-zero weight vectors, a, with components Q;, 


H;|Ea) = oi| Ea) (6.11) 
which means that the corresponding generators satisfy 
[Hi, Ea] = oi Ea (6.12) 


It will turn out (and we will prove it below) that for the adjoint representation, 
the non-zero weights uniquely specify the corresponding states, so there is no 
need for another parameter (like z in (6.3) in the arbitrary representation D). 
Like the SU(2) raising and lowering operators, the E, are not hermitian. 
They cannot be hermitian because we can take the adjoint of (6.12) and get 


[H:, Hi] = -o El (6.13) 

thus we can take 
El = Ea. (6.14) 
This should remind you of the raising and lowering operators J* and JT in 


SU(2). 

States corresponding to different weights must be orthogonal, because 
they have different eigenvalues of at least one of the Cartan generators. Thus 
we can choose the normalization of the states in the adjoint representation 
(that is, the generators) so that 


(E,|Eg) = A^ Tr (EEs) = bas (= [D 58). (6.15) 


The weights o; are called roots, and the special weight vector o with compo- 
nents a; is a root vector. 
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6.4 Raising and lowering 


— The £4, are raising and lowering operators for the weights, because the 
state Exo |n, D) has weight y + a — 


HjExslp, D) = [H;, Exo] |u, D) + Exo Hi|p, D) = (u + a)iE+alu, D). 
(6.16) 
At this point, we have no notion of positivity, so it doesn't make sense to ask 
which is raising and which is lowering. But we will introduce this later. 
Equation (6.16) is true for any representation, but it is particularly impor- 
tant for the adjoint representation. To see why, consider the state E4|E 4). 
This has weight a — a = 0, thus it is a linear combination of states corre- 
sponding to Cartan generators. This in turn implies that [E,, Eq] is a linear 
combination of Cartan generators: 


Eq|E-a) = 6: |Hi) = AH) = |8 - H) = (Es E-a). —— (617 
But we can actually compute  — 


Bi = (Hiļ|Ea|E-a) 
= à`! Tr (H; [Ea, E—a]) this follows from (6.8) 


) 
= A`} Tr (E-a [Hi, Ea]) from the cyclic property of Tr (6 18) 


= Aoi Tr (E-aEa) from (6.12) 
= Qi from (6.15). 
Thus 
(Eo, E-o] 2 a- H. (6.19) 


This should remind you of the SU (2) commutation relation [J*, J^] = Js. 
It is this analogy that we will exploit to learn about the compact Lie groups 
and their representations. 


6.5 Lots of SU (2)s 


For each non-zero pair of root vectors, ta, there is an SU (2) subalgebra of 
the group, with generators 


E* = lo[ Eta 


6. 
Ej = ja ?a- H. C2) 
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To see this, note that 


[E3, E*] = |o| 3 lo - H, Eza] (6.21) 
= |a|~3a- (ta)Exg = a| 1 E44 = +E* f 
and from (6.19) 
[E*, E^] = lol? (Eo, E-a] 


S E (6.22) 


We know on general grounds that the states of each irreducible represen- 
tation of the full algebra can be decomposed into irreducible representations 
of any one of these SU(2) subalgebras, and we already know everything 
about the irreducible representations of SU(2). This puts very strong con- 
straints on the nature of the roots. For example, we can now easily prove that 
the root vectors correspond to unique generators. Suppose the contrary, so 
that there are two generators, Eœ and E4. We can choose linear combina- 
tions of these two so that they are orthogonal in the adjoint representation (1 
will use the same names for the two generators, assuming that I chose them 
to be orthogonal from the beginning, just to avoid useless notation) — thus 
we can write 


(Eal En) = A^ Tr (ELEL) = A7! Tr (E-a E4) = 0. (6.23) 


Consider the behavior of the state | E7,) under the action of the SU (2) subal- 
gebra (6.20). E^ | E?) has zero weight vector, and thus it is a linear combina- 
tion of Cartan states. But 


(Hi| ET |E) = A Tr (H; [E7, E;]) 
= —\"! Tr (E` [Hi, E',]) (6.24) 
= =arx T! Tr (ELE) = 0 


for all i, and thus the coefficient of every Cartan state in E^ | E?) vanishes, 


and therefore 
E^ |E)) — 0. (6.25) 


But we also have 
E3|E;) = lo] ?o- H|E,) . = |Ea) (6.26) 


Equations (6.25) and (6.26) are inconsistent, because (6.25) implies that | E) 
is the lowest J3 state in an SU(2) representation, and (6.26) implies that it 
has J4 = 1. But the lowest J3 state of an SU (2) representation cannot have 
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positive J3 — the J3 value is always —j for a non-negative half integer 7. 
Thus E/, cannot exist, and we have shown, as promised, that | E) is uniquely 
specified by a — no other labels are required. 

In fact, if o is a root, then no non-zero multiple of o (except —o) is a 
root. To see this, note that the three states |E3) and |Æ} form a spin 1 repre- 
sentation of the algebra (6.20), because they form the adjoint representation. 
Now suppose ka a root for k # +1. Clearly, k must be a half-integer, be- 
cause the E; value of the corresponding state must be a half integer. But if k 
is an integer not equal to +1, the state is part of a representation that contains 
another state with root o, which is impossible, by the argument we just gave. 
And if k is half an odd integer, then there is a state with root a/2, and we can 
repeat the argument using the SU (2) associated with that generator and get a 
contradiction in the same way. 


6.6 Watch carefully - this is important! 

More generally, for any weight yu of a representation D, the E3 value is 
Es|u,z,D) = = les, D). (6.27) 

Because the £3 values must be integers or half integers, 


2o-u 


z~ is an integer. (6.28) 
a 


The general state |, z, D) can always be written as a linear combination 
of states transforming according to definite representations of the SU(2) de- 
fined by (6.20). Suppose that the highest spin state that appears in the linear 
combination is j. Then there is some non-negative integer p such that 


(E*)? |u,z, D) #0 (6.29) 


with weight +p o is the highest E; state of the SU (2) spin j representation, 
so that i 
(E+)?** |i, z, D) =0. (6.30) 


The £3 value of the state (6.29) is 


D + r ; 
aU PE LOB esr (6.31) 
Q Q 


Likewise, there is some non-negative integer q such that 


(E-)! |u, z, D) #0 (6.32) 
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with weight u — qa is the lowest E; state of the SU (2) spin j representation, 
so that 


(E-)** |u,z, D) =0 (6.33) 


and the £3 value of the state (6.32) is 


Qa: — qQ Qa: G 
2:89) L EE Lan j, (6.34) 


Adding (6.31) and (6.34) gives 


2a- u 


ai +p-—q=0 (6.35) 


or 
o: 1 
— = (nag). 6.36 

We will refer to (6.36) as the “master formula”. The relations (6.31), (6.34) 
and (6.36) are the basic relations that lead to a geometrical classification of 
all the compact Lie groups. They don’t look like much, but when we augment 
them with some geometrical intuition, we can exploit them to great effect, as 
you will see. 

Here is a simple first step. Applying (6.36) to the roots gives a particu- 
larly strong constraint, because we can apply it twice for any pair of distinct 
roots, o and 8. Defining the SU (2) algebra with Ea gives 

o: 


1 
ar -30 —q). (6.37) 


Defining the SU (2) algebra with Eg gives 


: 1 ; 
Ces F(a), (6.38) 


Multiplying these gives a remarkable formula for the angle 055 between the 
roots a and f: 


259, (a:b) _ (p- v a). (6.39) 


What is remarkable about this is that (p — q)(p' — q') must be an integer, so 
(because it must be non-negative) there are only four interesting possibilities 
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(up to complements) for angles between roots! 


u 


(p — a)(» —q) Bop 
0 90° 
1 60° or 120° (6.40) 
2 45° or 135° 
3 30° or 150° 


The possibility (p — g)(p' — q') = 4, corresponds to 0° or 180° — neither 
is interesting. 0° is already ruled out by our theorem on uniqueness. 180° 
is trivial because roots always come in pairs with opposite signs, both in the 
same SU (2) subgroup. 


Problems 


6.A. Show that [Ea, Ea] must be proportional to Egg. What happens 
if a + B is not a root? 


6.B. Suppose that the raising lowering operators of some Lie algebra 


satisfy 
[Ea Eg] = N Eo+6 


for some nonzero N. Calculate 


[Ea, E-a-ß ] 
6.C. Consider the simple Lie algebra formed by the ten matrices: 
Ta aTi 99373 72 


for a = 1 to 3 where ca and 7, are Pauli matrices in orthogonal spaces (see 
problem 3.E). Take Hı = o3 and H2 = 0373 as the Cartan subalgebra. Find 
(a) the weights of the four dimensional representation generated by these 
matrices, and 

(b) the weights of the adjoint representation. 

Hint: Although you have enough information to do the problem after read- 
ing this chapter, it may be easier after you have seen the example of SU (3) 
worked out in the next chapter. 


Chapter 7 


SU(3) 


After SU (2) the most important algebra in particle physics is SU (3). Maybe 
it is more important. I'm not sure. SU(3) is the group of 3 x 3 unitary 
matrices with determinant 1 (again, as in (3.30), the U stands for “unitary” 
and the S stands for "special", which means determinant 1). 


7.1 The Gell-Mann matrices 
SU (3) is generated by the 3 x 3 hermitian, traceless matrices. There are vari- 


ous ways of seeing that the tracelessness constraint is what gives determinant 
1. If we exponentiate the hermitian generators to get unitary matrices 


U (a) = etka (7.1) 
we can compute the determinant in any basis. In particular, if we diagonalize 
Qa Xa; 

VæaXıaV =D (7.2) 


where D is diagonal, we have 


det(U(a)) = det(e’?) = [pe = eh WD — gi Tracke (7.3) 
J 


Thus if Tr a, X, = 0, the determinant is 1. 
The standard basis for the hermitian 3 x 3 matrices in the physics litera- 
ture is in terms of a generalization of the Pauli matrices, called the Gell-Mann 
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matrices: 


a 

Il 
ATTN 
or © 
oo coc eX 
oo O 
ae” 

> 

a © 

n Il 
P dE 
os © 


1 

| 

0 

0 (7.4) 
] 

0 


Doomoo 
ot ee — l 


0 
À7 = 0 
0 


These are generalizations of the Pauli matrices in the sense that the first three 
Gell-Mann matrices contain the Pauli matrices acting on a subspace: 


wets D) pea tes (7.5) 


You can imagine that we could go on and construct 4 x 4 matrices that contain 
these 3 x 3 matrices in the same way, and so on. 
The SU(3) generators are conventionally defined by 


1 
T oA (7.6) 
2 
and they satisfy 
1 
Tr (ToT) = 59 (7.7) 


Clearly, T, for a = 1 to 3 generate an SU(2) subgroup of SU (3). 
This is sometimes called the isospin subgroup, for reasons that will become 
apparent when we discuss SU (3) as an approximate symmetry of the strong 
interactions — where this subgroup is in fact Heisenberg’s isospin. It is con- 
venient to put 735 in the Cartan subalgebra. There is one generator, Tg, that 
commutes with T3, so we put it in the Cartan subalgebra as well, and take 


H=  Ho-Ts (7.8) 
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7.2 Weights and roots of SU (3) 


The weights of this representation are easy to find because 73 and Tg are 
already diagonal 


Lu) 0 1 0 0 

2 

n= (i -i | ma F(a 1 | (7.9) 
0.0 0 


The eigenvectors, and associated weights are 


1 
(o) — (1/2, V3/6) 
0 


0 
B — (-1/2, V3/6) (7.10) 
0 


0 
(e) — (0, -V3/3) 
1 


These vectors, plotted in a plane, form the vertices of an equilateral triangle 


(—1/2, V/3/6) e e (1/2, V3/6) 
Hı > 


(7.11) 


(0, —- 3/3) 


The roots are going to be differences of weights, because the corresponding 
generators must take us from one weight to another. It is not hard to see that 
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the corresponding generators are those that have only one off-diagonal entry: 


1 : 

Va (Ti + 1T2) = Exipo 
1 . 

vg (T + iT) = Baspavsp (7.12) 
1 ; 

V2 (Te x iTr) = E51 /2,4/3/2 


where the + signs are correlated. The roots form a regular hexagon, plotted 
here along with the two elements of the Cartan subalgebra in the center: 


t 


(7.13) 


Problems 


7.A. Calculate f147 and f4sg in SU (3). 


7.B. Show that T1, T5 and T3 generate an SU (2) subalgebra of SU (3). 
Every representation of SU (3) must also be a representation of the subalge- 
bra. However, the irreducible representations of SU (3) are not necessarily 
irreducible under the subalgebra. How does the the representation generated 
by the Gell-Mann matrices transform under this subalgebra. That is, reduce, 
if necessary, the three dimensional representation into representations which 
are irreducible under the subalgebra and state which irreducible representa- 
tions appear in the reduction. Then answer the same question for the adjoint 
representation of SU (3). 
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7.C. Show that Az, A5 and A; generate an SU (2) subalgebra of SU (3). 
Every representation of SU(3) must also be a representation of the subalge- 
bra. However, the irreducible representations of SU(3) are not necessarily 
irreducible under the subalgebra. How does the representation generated by 
the Gell-Mann matrices transform under this subalgebra. That is, reduce, 
if necessary, the three dimensional representation into representations which 
are irreducible under the subalgebra and state which irreducible representa- 
tions appear in the reduction. Then answer the same question for the adjoint 
representation of SU(3). 


Chapter 8 


Simple Roots 


What we need to complete the analogy between SU (2) and an arbitrary sim- 
ple Lie algebra is a notion of positivity for the weights. Then we can dis- 
cuss things like raising and lowering operators, and the "highest weight" in 
a meaningful way. What we want is a definition that ensures that every non- 
zero weight is either positive or negative, and that if p is positive, —p is 
negative and vice versa. 


8.1 Positive weights 


It is easy to find such a scheme — indeed, in a multi-dimensional space, there 
are an infinite number. In some arbitrary basis for the Cartan subalgebra, the 
components, 441, H2, - - ., of the weight are fixed. We will say that the weight 
is positive if its first non-zero component is positive and that the weight is 
negative if its first non-zero component is negative. While this depends on 
the arbitrary basis, it does have the properties we want. Eventually, we will 
see that the results will not depend on the basis, but for now, we will just fix 
it and forget it. 


For example, in SU (3), the 3 dimensional defining representation looks 
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like this: 


(1/2, V/3/6) 


is positive 
H 1 


(—1/2, V/3/6) 


is negative 


(8.1) 


(0, —1/V/3) 


is negative 


The weight (0, —1/./3) is negative because its first component is zero so the 
sign is determined by the sign of the second component. 
With this definition, we can define an ordering in the obvious way: 


p > vif p — v is positive (8.2) 


This allows us to talk about the highest weight in a representation. 

In the adjoint representation, the positive roots correspond to raising op- 
erators and the negative roots to lowering operators. The highest weight of 
any representation has the property that we cannot raise it, so that all genera- 
tors corresponding to positive roots must annihilate the corresponding state. 

In the SU(3) adjoint representation, in our usual basis, the positive roots 
are on the right and the negative on the left, as shown below: 


T 
Hz 
°. °. 
negative | positive 


positive 
H 17 


negative 


(8.3) 


negative ositive 
ga p e 
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8.2 Simple roots 


But we don't want to check all the roots if we don't have to. Clearly, some 
of the positive roots can be built out of others. So it makes sense to define 
simple roots as positive roots that cannot be written as a sum of other positive 
roots. We will then see that if a weight is annihilated by the generators of all 
the simple roots, it is the highest weight of an irreducible representation. Fur- 
thermore, from the geometry of the simple roots, it is possible to reconstruct 
the whole algebra. The logic of this is fun and worth understanding in detail. 
1 — If a and f are different simple roots, then o — f is not a root. This is 
true because one of them, say f, is larger, so that G — a is positive. But then 
B is the sum of two positive roots, o and B — a. 

2 — Because a — £ is not a root 


E_q|Eg) = E-g|Ea) = 0 (8.4) 
Then in the master formula (6.36) 


a-B_ 1 


o2 -35(P = q) 
the integer q is zero. Similarly in 
B-a 1 
ua 7 -50 -d) 
the integer g’ is zero. Thus 
a. p p-a p 
Q3 gp o 


Knowing the integers p and p’ for each simple root is equivalent to knowing 
the angles between the simple roots, and their relative lengths because 


/vyry 2 
cos fag = E 5 = : (8.6) 


3 — The angle between any pair of simple roots satisfies 


T 

3 EU wm (8.7) 
The first inequality follows from (8.6) because the cosine is less than or equal 
to zero. The second inequality follows because all the roots are positive. Sim- 
ple multidimensional geometry then implies that the simple roots are linearly 


106 CHAPTER 8. SIMPLE ROOTS 


independent. Here's a proof — consider a linear combination of the simple 
roots, 


y= >> raa (8.8) 


If all the coefficients have the same sign, then clearly, y cannot vanish unless 
all the coefficients vanish, because the a are all positive vectors. But if there 
are some coefficients of each sign, we can write 


y-pu-v (8.9) 
where u and v are strictly positive vectors, 
p= »» BQO v = — 5 IQQ (8.10) 
ra 0 za «0 


But the norm of y cannot vanish because 
(u — vy! =p? +0 -2(u-v) > H> (8.11) 


where the last inequality follows from the fact (a - 8) < 0 for any pair of 
simple roots, (8.6). 

Thus no linear combination of the simple roots can vanish and they are 
linearly independent. 
4 — Any positive root ¢ can be written as a linear combination of simple 
roots with non-negative integer coefficients, ka 


$= M kso (8.12) 


This is just logic. If $ is simple, this is true. If not, we can split it into two 
positive roots and try again. 

5 — The simple roots are not only linearly independent, they are complete, 
so the number of simple roots is equal to m, the rank of the algebra, the 
number of Cartan generators. If this were not true, then there would be some 
vector £ orthogonal to all the simple roots, and therefore orthogonal to all the 


roots. But then 
[[: H, Eg] =0 forall roots ó (8.13) 


Since £- H also commutes with the other Cartan generators, it commutes with 
all the generators and the algebra is not simple, contrary to assumption. 

6 — Finally, we are in a position to construct the whole algebra from the 
simple roots. For now, we will simply show how to determine all the roots. 
We will find easier ways of doing this later, and also discuss how to construct 
the actual algebra. 
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We know that all the positive roots have the form 
Pk = >) kaa (8.14) 
Q 


for non-negative integers, ko, where the integer k is 


k=} ka (8.15) 


If we can determine which ¢, are roots, we will have determined the roots in 
terms of the simple roots. It is straightforward to do this inductively, using 
the master formula. 

All the $4's are roots because these are just the simple roots themselves. 
Suppose now we have determined the roots for k < £. Then we look at 


Eal¢e) (8.16) 


for all œ, which gives roots of the form ¢g41. We can compute 


2a ft LL (p d) (8.17) 

But we will always know q, because we will know the history of how ¢¢ got 

built up by the action of the raising operators from smaller k. Thus we can 
determine p. If p > 0, then ¢g + e is a root. 

Let's illustrate this inductive procedure for £ = 1. In this case, we always 

start with a simple root, $; = 8 where £ is a simple root. All the gs are zero 


SO 
2a-¢,  2a-B — 
a Qt 
Thus if o - 8 = 0, then p = 0 and a + £ is not a root. Otherwise p > 0 and 
a+ B is a root. 

The only way this procedure could fail to find a root is if there exists 
some positive root $;,4 which is not the sum of a root ¢g and some simple 
root. This is impossible, because if there were such a $544, it would be an- 
nihilated by all the E... (because E $41) if non-zero would be a ¢¢ state 
and we could apply E, to it and get |dg11) back). Thus |¢¢41) would have to 
transform like the lowest weight state of all the SU (2) subalgebras associated 
with the simple roots, which requires that the E; values a - $g41/a* < 0 for 
all o. But then 


—p (8.18) 


piyi = 5 kaa- be41 <0 (8.19) 
a 
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which is a contradiction. Thus we always find all the roots $;,.; by acting on 
all the ¢g with all the simple roots. For SU (3), for example, the positive root 
(1,0) is the sum of the other two, which are the simple roots 


a! = (1/2, V3/2) o? = (1/2, -V3/2) (8.20) 


as shown 
(8.21) 
We have 
(8.22) 
thus Os oe 
2a°-@ 2a°-a 
"E = EY = —1 (8.23) 


and thus p = 1 for both a! acting on [o?), and vice versa. Thus a! + o? is a 
root, but 2a! + o? and a! + 20? are not. 


8.3 Constructing the algebra 


The procedure outlined above can give us more than a listing of the roots. We 
can actually construct the entire algebra from the simple roots. Let us go back 
to the derivation of the master formula, where we found in (6.31) and (6.34) 


a: : Q-H : 
OE p=j — S] (8.24) 
Q 


This follows because a state |u) with weight y in any irreducible representa- 
tion must have some component that transforms under the largest spin repre- 
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sentation of the SU (2) algebra associated with a, generated by (from (6.20)) 


E* = |o| 1 Eza ds; 


Ej = |a| ?oa. H 


This largest spin is the 7 in (8.24). In general, |) may also have lower spin 
components, but j is the highest one. It must be there so that (E+)? |p) 
transforms like a |j, j} state, which is non-zero, but annihilated by another 
E*, and (E-)" ju) transforms like a |j, —7) state, annihilated by another 
E. Adding these two relations gives the master equation. But subtracting 
them gives 

pt+q=2j (8.26) 


Thus if we know p and q, we know the highest spin representation that over- 
laps with the weight state. 

If p is a root in the adjoint representation, the situation is even simpler. 
Because we have already shown that each root appears only once in the ad- 
joint representation, if we know p and q, and therefore j for a root under 
the action of the SU (2) associated with a simple root a, we can conclude that 
|G) is the state with E3 = a - 8/o? in the spin j representation, 


l8) = lo: 8/o?) (8.27) 
It is completely determined up to a phase. Thus we also know exactly how 


E+ acts, up to phases. Let's see how this enables us to construct the algebra 
in the example of SU (3). The root diagram looks like this: 


(8.28) 
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where 
2 2 
at ux =1 olo? = —1/2 
al. o? al- Q? 1 
al? o2? 2 


We already know that p = 1 and q = 0 for both a! acting on |), and vice 
versa. We already know how the Cartan generators commute with everything. 
So we just need the commutation relations of the raising and lowering oper- 
ators. Let's begin by explicitly constructing the raising operator, E,1,,42. 
Since p -- q — 1, we have j — 1/2 and therefore, if we look at the a! raising 
operator 


E*|E,2) = la! Ej |Eg2) 
= Eu|Ej) = [Egi Zeal) 


1 (8.29) 
ae 1/2) 


glo ea?) 


-E,|1/2, -1/2) = 


where 7 is a phase. This may need some explanation. The |E,) state is 
properly normalized, by assumption, and it corresponds to the |1/2, —1/2) 
state under the o? SU (2) (the E; value is o? o? Jat? = —1/2). Acting on it 
with the raising operator E* tells us what the correctly normalized |1/2, 1/2) 
state is. But this, up to a phase, which we called 7, must be the properly 
normalized state | E: , 42). Putting all this together, and choosing 7 = 1 by 
convention, we have 


\Balia2) = V2\[Eai, Ey2]) (8.30) 


and thus 


Exi4a2 = V2[Eq; Ea] (8.31) 


Now that we have expressed the other positive root as a commutator of 
the simple roots, we can compute any commutator just using the Jacobi iden- 
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tity. For example 
[Eoi , Eia] 
= V2[E 4, [Ej Ea] 
= V2[[E a, Eai], Eaz] (8.32) 
= 2 [-2' - H, Ea] 
1 
= —V2a! i o^ Eg: a ya a? 


We already knew this, because we are just moving back down the SU(2) 
representation. Here’s another, slightly more interesting: 


[E 42, Egi 192] 


= V2[E s, (Eo, Ea] 


(8.33) 
=v2 [Ears =Q H| 
EST la? -H, Ea] 

= V2a! -o? Ey: = -Ea 


The interesting thing here is the phase — which is determined to be a — sign. 


8.4 Dynkin diagrams 


A Dynkin diagram is a short-hand notation for writing down the simple roots. 
Each simple root is indicated by an open circle. Pairs of circles are connected 
by lines, depending on the angle between the pair of roots to which the circles 
correspond, as follows: 


CED ifthe angle is 150° (8.34) 
C&O ifthe angle is 135? 
Q-Q  ifthe angle is 120? 


© © ifthe angle is 90? 
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The Dynkin diagram determines all the angles between pairs of simple roots. 
This doesn't quite fix the roots, because there may be more than one choice 


for the relative lengths. We will come back to that later. 
Q is the diagram for SU (2) 


Q-—QO  isthe diagram for SU(3) 


8.5 Example: G3 
Suppose that an algebra has simple roots 
al = (0,1) a? = (v3/2, —3/2) 


This is an allowed pairing, because 


Mit "2g 
al-aq? = —3/2 
201 a Ll 2a! a gii 
al a? 
The angle between the two roots is determined by 
COS Ü,ig2 = E bataz = 150° 


Thus this corresponds to the Dynkin diagram 


CEO 
This algebra is called G2. 


8.6 The roots of G2 


(8.35) 


(8.36) 


(8.37) 


(8.38) 


For E,: acting on |o?) we have p = 3. For E,2 acting on |o!) we have 


p = 1. Thus 
olco? 2o!-o? and 3a! +Q? 


are all roots but 
al +20? and 4o! +a? 


(8.39) 


(8.40) 
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are not. In terms of ¢, in (8.14) we have 
dy =a! + a? $$ = 2a! + o? d4 = 3a! + a? (8.41) 


We know that the ¢ state is unique from the general properties of simple 
roots. The $3 state is unique because a? + 207, the only other state that could 
be obtained by acting on $», is not a root. To see whether there is another $4 
state, we must check whether 2a! + 20? is a root, that is, whether it can be 
obtained by acting on ¢3 with a simple root, which must be a. 
20?.(200 +a?) -6+6 _ 
o2? RENE 

But q = 0 because 2a! is not a root, and thus p = 0, so 2a! + 2a? is not a 
root. Actually, we could have come to this conclusion more simply by noting 
that 2a! + 2a? = 2(a! + a”) which is twice the root a! + a”, but we proved 
in the discussion after (6.26) that no multiple of a root can be a root. 

Now to get the ¢ states, note that we already know that 4o + o? is not 
a root, so we need only check 3a! + 2a?. 


0 — -(p — q) (8.42) 


2a? - (3a! +a?) -9+6 _ 

o2? uU mS 

Again, q = 0, thus p = 1 and ¢5 = 3a! + 2a? is a root. Because p = 1, we 

also know that 3a! + 30? is not a root, so to check for $e, we need only look 
at 4a! + 202. 

2a! - (3a! +207) 6-6 


al? 1 


-1 = -(p- q) (8.43) 


q = 0 because 2a! + 2a? is not a root, so we are finished (again, we could 
has just used the fact that 2a! + o? is a root to see that 4a! + 2a? is not), and 
the roots look like this: 


—o?e ®3a! + a? 


—a! — ae e 2a! +Q? 


janka (8.45) 


—3a! — ae 
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What we did can be summarized in the following diagram: 


(8.46) 


8.7 The Cartan matrix 


There is a useful way of keeping track of the integers p* and q associated 
with the action of a simple root o? on a state |¢) for a positive root $ that 
eliminates the need for tedious Ecomemeal calculations. The idea is to label 
the roots directly by their g^ — p’ values. The gq’ — p' of any weight, p, is 
simply twice its E4 value, where E; is the Cartan generator of the SU (2) 
associated with the simple root a’, because 


2H 2u- of te 
z |i) = lu) = (d - p) la). (8.47) 
Q 


2Es|u) = 


Because the a! are complete and linearly independent, the q* — p* values for 
the weights contain the same information as the values of the components of 
the weight vector, so we can use them to label the weights. The advantage of 
doing so is that it will make very transparent the structure of the representa- 
tions under the SU (2)s generated by the simple roots. 

Since a positive root, ¢, can be written as @ = 2: kjo? , the master 
formula can be written as 


NM ES 
q -P i2 


;2 (8.48) 
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where A is the Cartan matrix 


A. = 2a? -a 
ji = j2 


ot 


(8.49) 


The matrix element A,; is the q — p value for the simple root a’ acting on the 
state |o7), twice the Es value, thus all the entries of A are integers. The di- 
agonal entries are all equal to 2. This is obvious from the definition, and also 
from the fact that the simple roots have E3 = 1 because the SU (2) generators 
themselves transform like the adjoint, spin 1 representation. The off-diagonal 
entries, all 0, —1, —2 or —3, record the angles between simple roots and their 
relative lengths — the same information as the Dynkin diagram, and they tell 
us how the simple roots fit into representations of the SU (2)s associated 
with the other simple roots. It is easy to see that the Cartan matrix is invert- 
ible because the o7 are complete and linearly independent. Note that the jth 
row of the Cartan matrix consists of the q; — p; values of the simple root 
a, 
For SU (3), the Cartan matrix looks like 


2 —1 
(2 2 ) (8.50) 
For the G2 algebra we have just analyzed, it looks like 
2 -1 
( -3 2 ) (8.51) 


8.8 Finding all the roots 


We now show how to use the Cartan matrix to simplify the procedure of 
building up all the roots from the simple roots. When we go from ¢ to ¢+ o? 
by the action of the raising operator Eej, this just changes k; to kj + 1, and 
thus g’ — p! to ¢ — p' + Aj. 


kj — kj +1 
CO (8.52) 
-=p >g -p + Aji 


If we think of the qt — p as the elements of a row vector, this is equivalent to 
simply adding the jth row of the Cartan matrix, which is just the vector q — p 
associated with the simple root o7. This allows us to streamline the process 
of computing the roots. We will describe the procedure and illustrate it first 
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for SU (3). Start with the simple roots in the q — p notation. We will put each 
in a rectangular box, and arrange them on a horizontal line, which represents 
the k = 1 layer of positive roots — that is the simple roots. 


k= at, o i53 


It is convenient to put a box with 7n zeros, representing the Cartan generators, 
on a line below, representing the k = 0 layer. 


ki at, a? 859 
k-0 H; 


Now for each element of each box we know the q’ value. For the ith element 
of a’, ¢ = 2, because the root is part of the SU(2) spin 1 representation 
consisting of E,4; and o? - H. For all the other elements, g? = 0, because 
a’ — af is not a root. 


q=2 0 0 2 


1 ol, o? (8.55) 
k=0 H; 


Thus we can compute the corresponding p’. 


k 


il 


P=0 1 1 0 


k=1 ala? (856 
k=0 H 


Since the ith element of a’ is 2 (because it is a diagonal element of A), the 
corresponding p! is zero (of course, since 2a’ in not a root). For all the 
others, p is just minus the entry. For each non-zero p, we draw a line from 
the simple root to a new root with k — 2, on a horizontal line above the 
k — 1 line, obtained by adding the appropriate simple root. The line starts 
above the appropriate entry, so you can keep track of which root got added 
automatically. You can also draw such lines from the k = 0 layer to the k = 1 
layer, and the lines for each root will have a different angle. You then try to 
put the boxes on the k — 2 layer so that the lines associated with each root 
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have the same angle they did between the 0 and 1 layer. These lines represent 
the action of the SU (2) raising and lowering operators. 


P= 00 
q= 11 
k=2 
= (8.57) 
k=0 


The procedure is now easy to iterate, because everything you need to know 
to go from k = £ to k = £ + 1 is there in your diagram. At each stage, you 
compute p by subtracting the element of the vector from the corresponding q. 
For SU (3), the procedure terminates at k — 2, because all the ps are zero. 

Clearly, we could have continued this diagram farther down and shown 
the negative roots in the same way. 


k=2 
kel 
k=0 (8.58) 
k=-1 
k=-2 


8.9 The SU(2)s 


The transformation properties of the roots under the two SU(2)s should be 
obvious from (8.58). In fact, instead of thinking about p and q, we can just 
see how each E; value fits into an SU (2) representation. Then the process 
terminates as soon as all the SU(2) representations are completed. This is 
equivalent to actually computing the ps and qs, because we got the master 
formula by thinking about this SU (2) structure anyway, but it is much faster 
and easier. 
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Let us illustrate this with the diagram for G2: 


k=5 
k=4 
k=3 
k=2 
k=1 (8.59) 
k=0 


The argument in terms of the SU (2) structure goes as follows. We know 
that the simple roots a? = and a? = are the highest weights of 
spin 1 representations for their respective SU (2)s. must be the bottom 
of a doublet under a”, because of the —1, and is the bottom of a spin 
3/2 quartet under a+ because of the —3. So we just follow these up to the 


end, making sure that each root fits happily into representations under all the 
SU (2)s. is fine because it is the top of the o? doublet and in the a! 


quartet. is fine because it is an o? singlet and in the o! quartet. 
finishes the a! quartet and starts a new o? doublet. And finally, finishes 
the a? doublet and is an a! singlet, so it doesn't start anything new, and the 
process terminates. 


8.10 Constructing the G2 algebra 


We will do another example of this procedure later. Now let's stay with G2 
and construct the algebra. The two relevant raising operators are 


1 
Ef=E,. —Ej-——E (8.60) 


Start with the | £,2) state. We know, because p = 3 and q = 0 or by looking at 
the roots in (8.59) that it is the lowest weight state in a spin 3/2 representation 
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of the a! SU (2). Call it 
|Ea2) = 13/2, -3/2, 1) (8.61) 
Then applying the a! raising operator 
[Ea , E,2)) 
3 
= "HA —1/2,1) (8.62) 
3 
= 2 | Egi 402) 
The last line is a definition, in that it involves a phase choice, but the con- 
struction using the raising operator guarantees that we have the normalization 


right. Applying the raising operator again gives 


I[Eoi ; [Eoi , Ea2]]) 
- 2/2 13/2, 1/2,1) (8.63) 
= v6 | Fog 42) 


and a third time gives 


l[Ea [Eat [Ea Eal) 
= Vsvéis a, 1) 
=3 |Eso1 402) 

= 3|1/2, —1/2,2) 


(8.64) 


where we have written the last line because this is also the lowest weight state 
of spin 1/2 representation of the o? SU(2). Then applying the o? raising 
operator gives 


[[Ea2; [Ea [Eo ? [Ea , E23] 


1 
= EN. 11/2, 1/2, 2) (8.65) 
= 3 |E: 1 2) 

V6 3a! +2a 


Putting this all together, we have expressions for all the positive roots in 
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terms of the generators associated with the simple roots: 


š ; E 
Ezar ta = E Bats Eat, Ea? 
2al +Q? = 6 Ea: (£a: £a? (8.66) 


E3q1402 = 3 [Ea > [Ea E (Ea: , E,2)]] 
F3o14202 = > ru [Bas [E, al: [Ei , Eg2]]]] 
This is enough to determine all the commutation relations by just repeatedly 


using the Jacobi identity. For example, let's check that the a! lowering oper- 
ator acts as we expect. For example 


(E —als Ea! | 
= y3 Ee [Eats o2]] 
3 2 (E os E Ej] , Eo] (8.67) 


2 
- d lo- H, E] 
2 
= NM o? Eg = fÈ Bee 


This is what we expect for a lowering operator acting on |3/2, —1/2) state. 
The other relations can be found similarly. The general form involves some 
multiple commutator of negative simple root generators with a multiple com- 
mutator of positive simple root generators. When these are rearranged using 
the Jacobi identity, the positive and negative generators always eat one an- 
other in pairs, so that in the end you get one of the positive root states, or one 
of the negative root states or a Cartan generator, so the algebra closes. 


8.11 Another example: the algebra C3 


Let's look at the algebra corresponding to the following Dynkin diagram 


1 2 3 
565 m 
where " 2 g 
al =a =1 meer) (8.69) 
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2 -1 0 
—1 2 -1 (8.70) 
0 -2 2 


Then the construction of the positive roots goes as follows:! 


The Cartan matrix ts 


(8.71) 


This algebra is called C3. It is part of four infinite families of simple 
compact Lie algebras, called the classical groups. We will discuss them all 
later. 


8.12 Fundamental weights 


Suppose that the simple roots of some simple Lie algebra are o? for j = 
1 to m. The highest weight, 4, of an arbitrary irreducible representation, D 
has the property that u + ¢ is not a weight for any positive root, ¢. From the 
preceding discussion, it is clearly sufficient that 1 + af not be a weight in the 
representation for any 7, because then 


Exile) 20 Vj (8.72) 


which implies that all positive roots annihilate the state, because any positive 
root can be expressed as a multiple commutator of simple roots. We will 
see soon that this is an if and only if statement, because an entire irreducible 


! Here, because the group is rank 3, we are projecting a three dimensional root space onto 
two dimensions — for groups of higher rank, these diagrams can get quite busy. 
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representation can be constructed by applying lowering operators to any state 
with this property. So if (8.72) is true, then y is the highest weight of an 
irreducible representation. This means that for every E; acting on |u), p = 
0, and thus l 

RI RAD £ (8.73) 


where the £7 are non-negative integers. Because the o/s are linearly indepen- 
dent, the integers £7 completely determine p. Every set of £7 gives a u which 
is the highest weight of some irreducible representation. 

Thus the irreducible representations of a rank rn simple Lie algebra can 
be labeled by a set of m non-negative integers, 2’. These integers are some- 
times called the Dynkin coefficients. 

It is useful to consider the weight vectors, ju), satisfying 

20) . u* 
5— = Ójk (8.74) 


aJ 


Every highest weight, u, can be uniquely written as 
m H H 
w= >) Ow (8.75) 
j=l 


We can therefore build the representation with highest weight u by construct- 
ing a tensor product of £! representation of highest weight u}, 7? with highest 
weight u2, and so on. This representation will generally be reducible, but we 
can always then pick out the representation yz by applying lowering operators 
to |n). 

The vectors jj? are called the fundamental weights and the m irreducible 
representations that have these as highest weights are called the fundamental 
representations. We will sometimes denote them by DJ. Remember that 
the superscripts are just labels. The vectors also have vector indices. It's 
potentially confusing because both run from 1 to m. 

There is more to say about the Dynkin coefficients. Since the fundamen- 
tal weights form a complete set, we can expand any weight of any representa- 
tion in terms of them, as in (8.75). Then we can run the argument backwards 
and get (8.73) which implies that for a general jz, 


p -g-p (8.76) 


that is Æ is the q? — p) value for the simple root af. Thus the matrix elements 
of the vectors we were manipulating in constructing the positive roots of var- 
ious algebras were just the Dynkin coefficients of the roots (though of course, 


8.13. THE TRACE OF A GENERATOR 123 


for a general weight or root, the Dynkin coefficients will not necessarily be 
positive). In particular, the highest box in the construction is just the highest 
weight in the adjoint representation. The rows of the Cartan matrix are the 
Dynkin coefficients of the simple roots. Later, we will use a similar analysis 
to discuss arbitrary representations. 


8.13 The trace of a generator 


This is a good place to prove a theorem that we will see often in examples, 
and that will play a crucial role in the discussion of unification of forces: 


Theorem 8.9 The trace of any generator of any representation of a compact 
simple Lie group is zero. 


Proof: It suffices to prove this in the standard basis that we have developed 
in chapter 6 and this chapter, because the trace is invariant under similarity 
transformations. In the weight basis, every generator is a linear combination 
of Cartan generators and raising and lowering operators. The trace of raising 
or lowering operators is zero because they have no diagonal elements. The 
Cartan generators can be written as linear combinations of à - H, because 
the simple roots, G^, are complete. But each a? - H is proportional to the 
generator of an SU (2) subalgebra and its trace is zero because every SU (2) 
representation is symmetrical about 0 — the spin runs from —7 to j. Thus 
the Cartan generators have zero trace and the theorem is proved. 


Problems 


8.A. Find the simple roots and fundamental weights and the Dynkin di- 
agram for the algebra discussed in problem (6.C). 


8.B. Consider the algebra generated by og and canı where oq and rj, are 
independent Pauli matrices. Show that this algebra generates a group which is 
semisimple but not simple. Nevertheless, you can define simple roots. What 
does the Dynkin diagram look like? 


8.C. Consider the algebra corresponding to the following Dynkin dia- 
gram 
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where 
32 
a 


2 2 
al =? L2 =] 


Note that this is similar to C3 in (8.68), but the lengths (and relative 
lengths) are different. Find the Cartan matrix and find the Dynkin coefficients 
of all of the positive roots, using the diagrammatic construction described in 
this chapter. Don’t forget to put the lines in the right place — this will make 
it harder to get confused. 


Chapter 9 


More SU (3) 


In this chapter, we study other irreducible representations of SU (3) and in 
the process, learn some useful general things about the irreducible represen- 
tations of Lie algebras. 


9.1 Fundamental representations of SU (3) 


Label the SU (3) simple roots as 


at = (1/2, V/3/2) 


o? = (1/2, —V/3/2) fem 


Then we find the yit; the highest weights of the fundamental representations, 
by requiring that x’ - o? = 0 if x j 


p* = (a, t) 
ui - o) = (di F V3 b")/2 (9.2) 
—a-2zx3V 


and then normalizing to satisfy (8.74) which gives 


pt = (1/2, ¥3/6) 


g? = (1/2,-v3/6) i 
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ul is the highest weight of the defining representation generated by the Gell- 
Mann matrices — 


(—1/2, V3/6) © e (1/2, /3/6) 
Hi > 


(0, —/3/3) 


We can also write it in the q — p notation. In fact, we can use the properties of 
the lowering operators in the q—p notation to construct all weights by working 
down from the highest weight. jz! is associated with the vector (1,0), and in 
the graphical notation in which the simple roots look like 


(9.4) 


it looks like this 
(9.5) 


The highest weight is the top of an a! doublet. Subtracting o! gives E 


= [-11] thus because the second entry is a 1, and there is no state 


above it in the o? direction, it is the top of an o? doublet. Subtracting o? 
gives [|-11]- = [0 - 1] so the series terminates and the weights are 


m pl — ol pl — ol — o? (9.6) 
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There are many ways of obtaining the representation 1/2, or (0,1). The 
most straightforward way to construct it from scratch is to use the q — p 
notation, starting with the highest weight, and working downwards in the 
SU (2) representations: 


(9.7) 
so the weights are 
n m Lg? we -aol (9.8) 
or 
(1/2,-V3/6) (0, 3/3) (-1/2,—V3/6) (9.9) 
so in the H; plane, these look like 
T 
Hə 
(9.10) 
Hj — 
e e 


9.) Constructing the states 


Do all of these weights correspond to unique states so that the representation 
is 3 dimensional? We strongly suspect so, since that was the case for the (1,0) 
representation, and these two are clearly closely related. But we would like to 
be able to answer the question in general, so let's see how we might answer it 
for a general representation. Suppose we have an irreducible representation 
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with highest weight u. Then there will be one or more states, |u), and all the 
states in the representation can clearly be written in the form 


Eg, Es, dus Es, lu) (9.11) 


where the $; are any roots. But any such state with a positive @ can be 
dropped, because if a positive $ appears, we can move it to the right us- 
ing the commutation relations until it acts on |), which give zero. So we can 
take all the $; in (9.11) to be negative. 

But since every negative root is a sum over the simple roots with non- 
positive integer coefficients, we can take only states of the form 


E oA E ab "US E abn m (9.12) 


where af: are simple roots without losing any of the states. 

Now it is clear if it wasn’t before that the highest weight state is unique. 
If there were two states with the highest weight, the representation would 
break up into two independent sets of the form (9.12) above, and it would not 
be irreducible. 

In addition, any state that can be obtained in only one way by the action 
of simple lowering operators on |u} is unique. This shows that all the states 
in the (0,1) representation are unique. 

This technique of starting with the highest weight state and builiding the 
other states by the action of lowering operators actually allows us to build 
the representation explicitly. We can compute the norms of the states (see 
problem 9.A) and the action of the raising and lowering operators on the 
states is built in by construction. We will not actually use this to build the 
representation matrices, because there are usually easier ways, but we could 
do it if we wanted to. The point is that all we need to do to construct the 
representation explicitly is to understand the structure of the space spanned 
by the kets in (9.12). 

The first thing to notice is that two states produced by a different set of 
E_,i8 in (9.12) are orthogonal. This is clear because the linear independence 
of the os implies that the two states have different weights. But then they 
are orthogonal because they are eigenstates of the hermitian Cartan generators 
with different eigenvalues. 

The norm of any state can be easily computed using the commutation 
relations of the simple root generators. The norm has the form 


NTIPOAER dis E£? E E A E 2 S E an lu) (9.13) 


The raising operators, starting with Æ „s, can be successively moved to the 
right until they annihilate |i). On the way, they commute with the lowering 
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operators, unless the roots are a positive and negative pair, in which case 
the commutator is a linear combination of Cartan generators, which can be 
evaluated in terms of p and the afis, leaving a matrix element with one less 
raising and lowering operator. This is a tedious, but completely systematic 
procedure — you can easily write a recursive program to do it explicitly. 

If two states in (9.12) have the same set of (js but in a different order, 
then they have the same weight. In this case, we must compute their norms 
and matrix elements to understand the structure of the space. The matrix 
elements can be calculated in the same way as the norm in (9.13) because 
they have a similar structure: 


lu|Egu o En Eon E a E cas ES unen) (9.14) 


where y; and £j are two lists of the same set of integers but in a different 
order. Again, simply moving the raising operators to the right gives a com- 
pletely systematic way of computing the matrix element. 

Once you have the norms and the matrix elements, you can construct an 
orthonormal basis in the Hilbert space using the Gram-Schmidt procedure. 
For example, suppose that one weight space involves two states of the form 
(9.12), | A) and |B). Then if 


(A | B): (B | A) = (A | A) - (B | B) (9.15) 


the two states are linearly dependent, and there is a single basis state that we 


can take to be 
|A) 


Rc. (9.16) 
V (414) 
While if 
(A| B)-(B| A) # (A| A) -(B] B) (9.17) 


then the two states are linearly independent and we can choose an orthnormal 
basis of the form 


|A) 


—= and 
y (A | A) 


|B)(A | A) —|A)(A | B) (9.18) 


((A| A))-((A| B)- (B | A) - (A | A) - (B | B)) 


An orthonormal basis is all you need to complete the construction of the 
space. Again, I want to emphasize that while this procedure is tedious and 
boring, it is completely systematic. You could program your computer to do 
it, and thus construct any representation completely from the highest weight 
and the simple roots. 
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9.3 The Weyl group 


We could also have shown that the states in the u? representation are not 
degenerate by understanding the obvious symmetry of the representations. 
The symmetry arises because there is an SU (2) associated with each root 
direction, and all SU (2) representations are symmetrical under the reflection 
E3 — —Es. In particular, if u is a weight, and E3 = a- H/ a? is the E3 
associated with the root o, then 


Q . 
Eslu) = Fl) (9.19) 
and the reflection symmetry implies that u — (q — p)o (where (q — p) = 
2(a - u/o?)) is a weight with the opposite Es value 


Es|u — (q — p)a) = -2f le- (q — p)o) (9.20) 


Note that the vector average of u and its reflection, u — (q — p)o, is u — (q — 
p)a/2 which is in the hyperplane perpendicular to a, because 


a- (p — (q — p)a/2) =a- u — (q — p)o?/2 — 0 (9.21) 
as illustrated below: 


u-—(q-p)a p—(q-p)a/2 H 


(9.22) 
Q 


All such reflections for all roots are transformations on the weight space 
that leave the roots unchanged. We can obtain additional transformations 
that leave the roots unchanged by combining two or more such reflections. ! 
The set of all transformations obtained in this way forms a transformation 
group called the Weyl group of the algebra. The individual reflections are 
called Weyl reflections. The Weyl group is a simple way of understanding 
the hexagonal and triangular structures that appear in SU (3) representations. 


lFor example, in SU(3), if you first reflect in the plane perpendicular to o and then in 
the plane perpendicular to a”, the result in a rotation by 120°. 
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9.4 Complex conjugation 


Notice that the weights of the second fundamental representation are just the 
negatives of the weights of the first. 


(9.23) 


This means that the two representations are related by complex conjugation. 
If T, are the generators of some representation, D, of some Lie Algebra, 
the objects —T* have the same commutation relations (because the structure 
constants are real) — 


(I5 mp es dod." 
m , Tř] =-tf, bol, (9.24) 
[-Ti, m] = i fore(-Tr) 


It is called the complex conjugate of the representation D, and is some- 
times denoted by D. D is said to be a real representation if it is equivalent 
to its complex conjugate. If not, it is a complex representation. The Cartan 
generators of the complex conjugate representation are —H;*. Because H; 
is hermitian, H;* has the same eigenvalues as H;. Thus if is a weight in 
D, —p is a weight in D. In particular, the highest weight of D is minus the 
lowest weight of D. 


Obviously, in SU (3), the representation (0,1) is the complex conjugate 
of (1,0). The lowest weight of (1,0) is —”, the negative of the highest weight 


132 CHAPTER 9. MORE SU(3) 


of (0,1) and vice versa. 


(9.25) 


There are various different notations used in the physics literature for 
the SU (3) representations. Sometimes, you will see the Dynkin coefficients 
(q1, 92). This is always a unique labeling. But a common shorthand notation, 
which is not too confusing for small representations is to give their dimension 
and to distinguish between a representation and its complex conjugate with a 
bar, so that (for example), 


1,023 (0123 (9.26) 
In general, the complex conjugate of (n, m) is (m, n). This is clear if 
you think about the highest and lowest weights. Because the lowest weight 
of (1,0) is minus the highest weight of (0, 1), and vice versa, we have 
(n,m) has highest weight nu! + my? 
: EN MEE 
(n, m) E: lowest weight nu^ — mp (9.27) 
(m,n) has highest weight nu? + mpl 


Representations of the form (n, n) are real. 


9.5 Examples of other representations 
Consider the representation (2,0). It has highest weight 


20! = (5 1/v3) (9.28) 
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In the q — p notation, of (9.4) and (9.5) it looks like this 


(9.29) 


or in the Cartan space 


(9.30) 


All the weights correspond to unique states. 25! — a1, 2u! — 2a! and 
2u! — a! — o? are reached by unique paths from the highest weight, and the 
others are related to these by Weyl reflections. Thus this is a 6-dimension 
representation — (2,0) = 6. It is evidently complex. Its complex conjugate, 
(0, 2) = 6 is inverted in the origin in the Cartan plane, and reflected about a 


vertical line in the q — p picture: 
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(9.31) 
(9.32) 

Now consider the representation (1,1). Note that 
pl +R ol +a? (9.33) 


but a} +0? is the highest weight of the adjoint representation (8.58) We know 
that the zero weight is doubly degenerate — corresponding to the two states 
of the Cartan subalgebra. The two ways of getting to zero from the highest 
weights give different states. You can show that 


Eg E 2|! T R?) X E q2E m T p) (9.34) 


You will do this in problem (9.A). 
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Now look at (3,0), with highest weight 34! 
(9.35) 


The weight diagram is 


(9.36) 


3u! — 3a! 


All the states are obviously unique except the state in the center. But it 
turns out to be unique also. The two vectors 


E 4E E 3n!) 


(9.37) 
E 4E 4E q|3ul) 
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are linearly dependent, because we can write 


E 2E QE Qu |3p!!) 

= [Eaz Bo] E-a 3H) 
+E_o1 E_92E_o1|3p') 

= OE [E_o2, E-o] 134") 
+E_ i E_.2E_91|3p") 

= E_ yt E_.2E_91|3p") 
+E_o1 E_g2E_g1|3p") 

= 2E_ 1 E_g2E_q1|3p') 


(9.38) 


2 


where the second step follows because —2a! — a? is not a root, and thus 


[E 42, E qi], E-a] =0 (9.39) 


2 


and the second to last because 3u! — o? is not a root, and thus 


[E 42, E] |3u') = E 42 E |31l) (9.40) 
Thus this is a ten dimensional representation. It is sometimes called just 
the 10. Its complex conjugate, the (0, 3) representation, is the 10. 


Problems 


9.A. If |x) is the state of the highest weight (u = u! + u?) of the adjoint 
representation of SU (3), show that the states 


|A) = Eœ E-a? |u) 


and 
|B) = E_g2 E-a! m 


are linearly independent. Hint: Calculate the matrix elements (A | A), (A | 
B), and (B | B). Show that | A) and | B) are linearly dependent if and only if 


(A | A(B| B) = (A | B)(B | A) 


9.5. EXAMPLES OF OTHER REPRESENTATIONS 137 


9.B. Consider the following matrices defined in the six dimensional ten- 
sor product space of the SU (3) Aq matrices and the Pauli matrices c5: 


1 
2^472 for a — 1,3,4,6 and 8 
1 
3^« for a = 2,5 and 7 
Show that these generate a reducible representation and reduce it. 


9.C. Decompose the tensor product of 3 x 3 using highest weight tech- 
niques. 


Chapter 10 


Tensor Methods 


Tensors are great tools for doing practical calculations in SU (3) and in many 
other groups as well. As you will see, the idea of tensors is closely related to 
the idea of a wave function in quantum mechanics. 


10.1 lower and upper indices 


The idea starts by labeling the states of the 3 as 


1/2, V3/6) = |) 
[-1/2, V3/6) = |;) (10.1) 
|o, —1/V3) = |g) 


The 1, 2 and 3 are to remind you of the fact that the eigenvectors of the Hı 
and H, matrices corresponding to these weights are vectors with a single 
non-zero entry in the first, second and third position. We have also written 
the indices below the line for a reason that will become clear shortly. 
If we define a set of matrices with one upper and one lower index, as 
follows 1 
[Te]; = z Aalis (10.2) 


then the triplet of states, |;}, transforms under the algebra as 


T end (10.3) 


The important thing to notice is that the sum over 7 involves one upper and 
one lower index. 
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Label the states of the 3 as 


|-1/2, - 3/6) =|’) 


11/2, -V3/6) = |?) (10.4) 
l0, 1/V3) = |?) 
Then l l 
Tal) = - la]; (10.5) 


This is true because the 3 is the complex conjugate representation, generated 
by 
-[r7] = -[T7} = -Ta (10.6) 


Now I can define, as usual, a state in the tensor product of n 3s and m 3s. 


nones = py. mp4 (10.7) 
It transforms as 
diem 
T.I 5) 
S iiim k 
= z [s deat! Hol (10.8) 
—ıki m L 
=o ihe) m 
£—1 


This distinction between upper and lower indices is useful because SU (3) 
has two different kinds of 3-dimensional representations — the 3 and 3. We 
need some way to distinguish them. Raising and lowering the indices is just 
a handy notational device. 


10.2 Tensor components and wave functions 


Now consider an arbitrary state in this tensor product space 


jy aue (10.9) 


Jin 1m 


v is called a tensor. A tensor is just a “wave-function”, because we can find 
v by taking the matrix element of |v) with the tensor product state. 


dio epp») (10.10) 


Ui. m Ji In 
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The correspondence here is exactly like the relation between the space wave 
function of a particle in quantum mechanics and the state describing the par- 
ticle in the Hilbert space. 


p(z) = (z|v) (10.11) 


The tensor v is characterized by its tensor components, vi n, 2. Now 


we can think of the action of the generators on |v) as an action on the tensor 
components, as follows: 


Talo) = |Tav}) (10.12) 
where 
To 2e 
Z k 
= (T. a " ex. 
m (10.13) 
ex 


pL Ev Jija 
alig Ui kim 


=1 


This defines the action of the generators on an arbitary tensor! 


10.3 Irreducible representations and symmetry 


We can now use the highest weight procedure to pick out the states in the ten- 
sor product corresponding to the irreducible representation (n,m). Because 
|,) is the highest weight of the (1,0) representation, and ^) is the highest 
weight of the (0,1) representation, the state with highest weight in (n, m) is 


nie (10.14) 
It corresponds to the tensor vg with components 
vg]. p= = NójiÓ41042:70i42 (10.15) 


Now we can construct all the states in (n, m) by acting on the tensor vg 
with lowering operators. The important point is that vy has two properties 
that are preserved by the transformation vy — T,vy. 


1. vy is symmetric in the upper indices, and symmetric in the lower in- 
dices. 


2. uy Satisfies 
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à vy) 20 (10.16) 


Tim 

The first is preserved because the generators act the same way on all the upper 
indices and the same way on all the lower indices. The second is preserved 
because of the minus sign in (10.13) and the tracelessness of the Tas. All the 
states in (n,m) therefore correspond to tensors of this form, symmetric in 
both upper and lower indices, and traceless (which just means that it satisfies 
(10.16) which is a generalization of the condition of tracelessness for a ma- 
trix). It turns out that the correspondence also goes the other way. Every such 
tensor gives a state in (n, m). 


10.4 Invariant tensors 


The oa is called an invariant tensor. An invariant tensor is one that does 
not change under an SU (3) transformation. The change in a tensor is propor- 
tional to the action of some linear combination of generators on it, but 


[Tad] = (Tali, 9* — (Ta]* 6}, = [Tah — [Ta]; = 0 (10.17) 


thus Pa doesn’t change under any SU (3) transformation. There are two other 


invariant tensors in SU (3) — the completely antisymmetric tensors, c? * and 
€ijk- These are invariant because of the tracelessness of Ta. Consider 


[Tn e[7* = [Tae e* + eA + [Talk e?* (10.18) 
This is completely antisymmetric, so we can look at the 123 component — 


[Ta die = [EC c tu [1:12 c8 E nay c 


zc 1 123 T2123 3 123 . (10.13) 
= [Ta] E + [Talge + [Talge* = 0 


Thus et% is invariant. 


10.5 Clebsch-Gordan decomposition 


We can use tensors to decompose tensor products explicitly. Suppose that u 
is a tensor with n upper indices and m lower indices, and v is a tensor with 
p upper indices and q lower indices. Then it follows from the definition of a 
tensor that the product, u & v defined by the product of the tensor components 


4 t 


dede yir in p (10.20) 


i-i 

1 

jj jej 4j Ua suat 
mJ, Ja m Ji Jq 


[u & v] 
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is the tensor that describes the tensor product state |u) & |v). Thus we can 
analyze tensor products by manipulating tensors — the wave-functions of the 
corresponding states. The general strategy for doing these decompositions 
is to make irreducible representations out of the product of tensors, and then 
express the original product as a sum of terms proportional to various irre- 
ducible combinations. The advantage of this procedure is that we are directly 
manipulating wave functions, which is often what we want to know about. 

Consider, for example, 3 & 3. If we have two 3s, ut and vÍ, we can write 
the product as 


wo = (ui v e uf vt) + ze eremu v (10.21) 


The first term, ' 
5 (ui vitul vi) (10.22) 


transforms like a 6. This contains the highest weight state ul yl. We have 
added the v? v’ term to make it completely symmetric in the two upper in- 
dices, and thus irreducible. The lower index object 


€t ut v™ (10.23) 


having only one lower index, transforms like a 3. Thus we have explicitly 
decomposed the tensor product into a sum of 6 and a 3. Not only does this 
show that 

39326903 (10.24) 


is (1,0) & (1,0) = (2,0) & (0, 1) (10.25) 


it shows us how to actually build wave functions with the required symmetry 
properties. Later, we will see how this makes some kinds of calculations easy. 
Note also how as in (10.23) whenever a tensor with more than one upper 
index is not completely symmetric, we can trade two upper indices for one 
lower index using the e tensor. 
Next, let's look at 3 & 3, a product of u! (a 3) and v; (a 3). We can write 


: ; . TU 
u'vj— (v vj — 35 u* ve) + 35 uf uk (10.26) 


The first term in parentheses is traceless, and transforms like the 8, while the 
tensor with no indices, u*v,, transforms like the trivial representation, (0,0), 
or 1. Thus 

393-8061 (10.27) 
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Or 
(1,0) & (0,1) = (1,1) e (0,0) (10.28) 


Notice the role of the invariant tensor, 5. One way of understanding why 
only traceless tensors are irreducible is that if a tensor is not traceless, the ô 
tensor can be used to construct a non-zero tensor with fewer indices, and thus 
explicitly reduce it. . 

One more example — ut (a 3) times v; (an 8). 


uw = 

1 ij i $44 1 VEM AC 

5 (u v} +u vk — -0k u vy — 49, u UP 

s (10.29) 
tae (€emn U” vg + Ekmn U” vg) 


1 : ` ; . 
*g (33 uf v) — ut ve) 
where the first term on the right hand side is a (2,1) (or 15 — it’s 15 dimen- 
sional), the second term is a 6, and the last is a 3. 


3982150603. (10.30) 


Or 
(1,0) & (1,1) = (2,1) e (0,2) & (1,0). (10.31) 


10.6 Triality 


Notice that (n — m) mod 3 is conserved in these tensor product decomposi- 
tions. This is true because the invariant tensors all have (n—m) mod 3 = 0, 
so there is no way to change this quantity. It is called triality. 


10.7 Matrix elements and operators 


The bra state (v| is 
(v| = oft f^ " (aim (10.32) 


Wim  Urjn 
The bra transforms under the algebra with an extra minus sign. For example, 
the triplet (;| transforms as 


—(\Ta = —GlTal Gl = — Ursi (| (10.33) 
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In words, this says that the bra with a lower index transforms as if it had an 
upper index. This is because of the complex conjugation involved in going 
from a bra to a ket. Similarly, the bra with an upper index transforms as if it 
has a lower index. This suggests that we define the tensor corresponding to a 
bra state with the upper and lower indices interchanged so that the contraction 
of indices works in (10.33). Thus we say that the tensor components of the 
bra tensor (v| are 


viim = piirin * (10.34) 
so that (10.32) becomes 
CEN Ge (1035) 


Then when the state (v| is transformed by — (v|T;, the tensor 7 is transformed 
by Tav. 
For example, consider the matrix element 
_ kiek ejn skieekmji i 
(ulo) = apte vide ncm) 
L hike jija sii., gim sli... gln 
=U Ub on k ig C (10.36) 


£14 diim 


The indices are all repeated and summed over (contracted for short), which 
they must be because the matrix element is invariant. 


10.8 Normalization 


A corollary to (10.36) is that if the state |v) is normalized, satisfying (v | 
v) = 1, then the tensor components must satisfy a normalization condition 


5c quac e (10.37) 
ivan 


For example, the tensor vz in (10.15) satisfies (10.37), because only a single 
term contributes in the sum. But a tensor of the form 


N(26161 — 6162 — 6263) (10.38) 
describes a state with norm 
NEDERL- 6468 — 468)? = wr D +D +T) 2er 


jk 
(10.39) 
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10.9 Tensor operators 


We can extend the concept of tensors to include the coefficients of tensor 
operators in an obvious way. For example, if a set of operators, O, transform 
according to the (n,m) representation of SU(3), we can form the general 
linear combination 


W = wi" oiim (10.40) 


115m J1'7Jn 
then if W transforms by commutation with the generators, the coefficients, 
w1!'7^. transform like (n, m), (10.13). 


ü “dm? 


10.10 The dimension of (n,m) 


A very simple application of tensor methods is the calculation of the dimen- 
sion of the irreducible representation (n,m). The dimension of the repre- 
sentation is the number of independent components in the tensor. We know 
that the tensor has n upper and m lower indices, and is separately symmetric 
in each. The number of independent components of a object symmetric in 
n indices each of which run from 1 to 3 is equal to the number of way of 
separating n identical objects with two identical partitions — which is 


(10.41) 


es — (n2). (n+2)(n +1) 
2 )- nl2!1 — 2 


Thus if there were no other constraint, the number of independent coefficients 
would be 

(n + 2)(n +1) (m+ 2)(m 4 1) 

2 2 

However, the tensor is also required to be traceless. This says that the object 
we get by taking the trace vanishes, and it is symmetric in n—1 upper indices 
and m —1 lower indices. Thus this imposes B(n—1, m—1) constraints, so the 
total is 


B(n,m) = (10.42) 


D(n,m) = B(n,m) — B(n — 1, m — 1) 
_ (n+2)(n +1) (m  2)(m +1) 

2 
a D i (10.43) 
_ (nF Dm Dn e 2-2) - nm] 


x EET EE ET 
2 
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You can check that this formula works for the small irreducible representa- 
tions that we have discussed. 


10.11 * The weights of (n, m) 


We can now put together various pieces of what we know about the irre- 
ducible representation (n,m) of SU (3). A general irreducible representation 
of SU (3), (n, m), has the form of a hexagon, illustrated below for n = 8 and 
m= 4. 


m41 
\ weights 


! ni 
€ e o œo o è è > Vejghts 


(10.44) 
The highest weight is marked with the arrow. The weights along the dotted 
line can be reached from the highest weight by repeated application of the 
lowering operator E ,:. Thus counting the highest weight, there are n + 1 
weights along the dotted side of the hexagon. The weights along the dashed 
line can be reached from the highest weight by repeated application of the 
lowering operator E 2. Thus counting the highest weight, there are m + 1 
weights along the dashed side of the hexagon. If either n or m is O, the 
hexagon degenerates in to a triangle. 


The Wey] reflection symmetries guarantee that the hexagon is symmetri- 
cal. Thus the three dashed sides are equivalent in the diagram below, as are 
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the three dotted sides. 


N 

/ e » m+] 

\ weights 
» 


B e e e e e e e e e e $ (10.45) 


a ; nl 
e e e e e e e . weights 


The dashed and dotted sides together make up an outer layer of weights, all of 
which correspond to unique states in the irreducible representation because 
they are equivalent by Weyl reflection to states that can be reached uniquely 
by simple lowering operators from the highest weight. Obviously, we can 
define a next layer of weights by considering the hexagon immediately inside 
this one, and we can continue this process until each weight is assigned to a 
“layer”. 


We will now show how to prove the following result: 


Theorem 10.10 As you go in from one layer to the next, the degeneracy of 
the weight space for states in the layer increases by one each time, until you 
get to a triangular layer. From then on, the degeneracy remains constant. 


Theorem 10.10 implies that the degeneracies of the layers in our example are 
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as shown in the following figure. 


© BE © P e e. e a E E. (10.46) 


We will prove theorem 10.10 by counting the independent components of 
the tensors which represent these states. So we must first understand in detail 
how to go from weights to tensor components. We begin by considering ten- 
sors without the constraint of tracelessness. Such a tensor is simply the most 
general object completely symmetric in its upper and lower indices. We will 
then use the strategy of the previous section (on the dimension of (n, m)) to 
deal with the trace constraint. The advantage of this, as we saw above, is that 
it is easy to count the independent states because the tensor components are 
completely determined by the number of upper and lower components of each 
value. Now consider a tensor symmetric in n upper indices and in m lower 
indices. This corresponds to a reducible SU (3) representation, but it con- 
tains the general irreducible representation (n, m) that we wish to study. The 
weight diagram for this reducible representation will be the same as (10.44), 
but the degeneracies of the states corresponding to each weight may be dif- 
ferent (and larger). The outer layer, however, is nondegenerate even for this 
more general tensor, because it contains no tensor components which have the 
same value of any upper and lower index, so there is no trace constraint. As 
we go around this outer layer, along each side, one index changes to another 
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as illustrated in the following figure (where J indicates a lower j index). 
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(10.47) 
To deal with the inner layers, we first need to understand the connec- 
tion between the tensor components and the weights. Consider a particular 
component of the general tensor with n upper and m lower indices. We will 
assume, for convenience that n > m (as is the case for the example in the 
figures). This is not a serious restriction, because if it is not true for a rep- 
resentation, then it is true for the complex conjugate representation obtained 
by changing the signs of all the weights (with all degeneracies remaining un- 
changed). Thus if we understand the representations for n > m, we actually 
understand them all. Define z(7) to be the number of upper components with 
the value j, and Z(j) to be the number of lower components with the value 7. 
Thus for example, for the highest weight of our tensor, z(1) = n, 2(2) = m, 

and the others are all zero. Note that the three quantities 


z(1) — z(1), z(2) —z(2), z(3) — z(3), (10.48) 


are determined by the weight associated with the component and the value of 
n — m. By straightforward calculation, the 1-component of the weight vector 
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satisfies 


l 


z 1 Z 
ii = 3PC!) — 2(1)] - 562) - 2(2)]. (10.49) 


The 2-component of the weight vector satisfies 


y3 


Bia) - za) + Sere) - za] - Y 


"E [2(3) = 2(3)]. (10.50) 


And n — m satisfies 
n =m = [z(1) — z(1)] + [2(2) — z(2)] + [2(3) — z(3)]. (10.51) 


Thus 


2(1) ~2(1) = sla + 2V3 u2 + 2(n — m) 
2(2) -z(2) = ER 4-28 us + 2(n — m] (10.52) 
(3) - x) = sl(n — m) - 2V3 u] 


Actually, rather than using (10.52), it will be easier just to follow what is 
happening to the z(7) — Z(7) by thinking about what happens to the indices 
as you move around the weight diagram, but the important point is that these 
differences of upper and lower indices are all fixed. 


Now suppose that we move into the k-th inner layer by taking k steps to 
the left. Assume that k < m so we are still in a hexagonal layer (the number 
of weights along the upper-right-hand edge of the layer decreases by one for 
each increase in k, thus k = m is the corner of the first triangular layer — we 
will come back to this later). Then take 7 steps down to a weight on the k-th 


10.11. * THE WEIGHTS OF (N, M) 151 


inner layer, as shown below (for k = 3 and 7 = 2): 


(10.53) 


As we move in horizontally, in each step either 1 — 2, or 2 — 1, or 
1 — 3 and 2 — 3 (see (10.47), and thus z(1) — z(1) is decreased by one and 
z(2) — z(2) is increased by one. Similarly, as we go down the kth layer, at 
each step z(1) — z(1) is decreased by one and z(3) — z(3) is increased by 
one. And thus 


z(2) — z(2) = — (m — k) (10.54) 
z(3) - z(3) = j 


Now we can count the number of components. (10.54) implies that the 
tensor component can be written with 


n — k — j 1s, m — k 2s, j 3s, and k j-j pairs, where (10.55) 
jis1,20r3. 


The j-j pairs do not contribute to the z(j) — z(j) values —they do not affect 
the weight, but they give different independent tensor components. Thus the 
number of ways we can get this weight is the number of different ways we 
can choose these Kk pairs, which is 

es _ (k+2)(k+1) 


3 (10.56) 
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Note that it only depends on & — that is it only depends on the layer, not 
on where we are in the layer. It is easy to see that this is also true for the 
weights above the horizontal. The argument is similar and we leave it as an 
exercise for the reader. (10.56) is the number of independent components for 
the general tensor with n upper and m lower components. The number of 
components in the traceless tensor is this minus the number of independent 
components of the same weight in the trace. The trace is a general tensor 
with n — 1 upper and m — 1 lower indices. It's highest weight is one step 
to the left of the highest weight of (n,m). From the highest weight of this 
tensor, it takes only k — 1 steps to get to the same weight. Thus the number 
of components of the trace is (10.56) for k — k — 1, and the number of 
independent components of the traceless tensor is 


(F25-(1- Graben E uus (10.57) 


This simply implies that the degeneracy increases by one each time we move 
in a layer, in agreement with theorem 10.10. 

This analysis remains valid in to k = m. But when k > m, that is after 
the first triangular layer, (10.55) doesn't make sense because the number of 
2s cannot be negative. Instead, the component must look like 


n —k-— j 1s, k — m 2s, j 3s, and m j-j pairs, (10.58) 
where 7 is 1, 2 or 3. 


Thus from this layer on, m plays the role of k, and the degeneracy remains at 
m + 1 the rest of the way in. This completes the proof of theorem 10.10. 


10.12 Generalization of Wigner-Eckart 


Consider a matrix element of a tensor operator between tensor states 
(u|W |v) (10.59) 


This is really a shorthand for a very large number of possible matrix ele- 
ments, because each representation may have many independent components 
(the number being just the dimension of the representation). Suppose that 
the representations of v, u and W are irreducible. Then we expect that the 
symmetry will ensure that there are relations among the various possible ma- 
trix elements. For example, if these were SU(2) tensor operators, we know 
that all the matrix elements would be determined in terms of a single num- 
ber. To determine the consequences of the symmetry in this case, we imag- 
ine decomposing the state W |v} into irreducible representations. Then we 
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know there can be no contribution to the matrix element except from repre- 
sentations that transform under the same irreducible representation as u. This 
follows from Shur's lemma. Also because of Shur's lemma, the contribution 
from each appearance of the representation of u is determined by a single 
unknown number, because the matrix element is proportional to the identity 
in the space of the u representation. Thus there will be one (and only one) 
unknown constant in the matrix element for each time the representation of u 
appears in the tensor product W & v. However, there is an important differ- 
ence between this and the case of SU (2). Here u may appear more than once 
in the decomposition of W |v). 

Tensor analysis will make it relatively easy to write an expression for the 
matrix element, (10.59), that automatically incorporates all of the constraints 
that follow from the symmetry. Because of (10.9), (10.35), and (10.40), the 
matrix element will be proportional to the tensor components of |v) 


uo (10.60) 
and of W 
Wire (10.61) 


and proportional to the conjugate components of (u| 
Ti r (10.62) 


The entire matrix element must be invariant. Which means that the answer 
for the matrix element must have all the indices either contracted with each 
other or with invariant tensors (that is ds and es). Another way to say the 
same thing is that the matrix element 

Mot! ] - 

(1 n! | £A Adm) 
A3 kika lirin 
e Aims 
— tink kath d, 


(10.63) 


must itself be an invariant tensor — a sum of terms made of ds and es. 

Here is an example that will be of some interest later. Suppose that u, W 
and v are all 8s. This is particularly easy to analyze, because u, W and v can 
all be thought of as matrices, with the upper index referring to the row, and 
the lower index to the column. Then, for example, 


Wivi = [W v] (10.64) 


154 CHAPTER 10. TENSOR METHODS 


Now a term with all indices contracted is one in which the matrices are com- 
bined into a number by matrix multiplication, and taking traces. There are 
precisely two such terms: 


Trí(uW v) and Tr(uvW) (10.65) 


There are two terms because the tensor product 8 & 8 contains 8 twice. Thus 
in this case the matrix element is 


M Tr(UW v) + Ag Tr(uv W) (10.66) 


where A; and Àz are constants that must be fixed by the physics, rather than 
the group theory (like the reduced matrix elements in the Wigner-Eckart the- 
orem for SU(2)). This means that all 8? = 512 matrix elements are de- 
termined in terms of these two parameters! Furthermore, we know how all 
the matrix elements are related simply by matrix multiplication, without any 
fancy tables or whatnot. 

Here's another example — suppose u and v are 10s, and W is an 8. The 
matrix element will be proportional to the tensor components of |v) 


yiiizis (10.67) 


and of W 
wk (10.68) 


and proportional to the conjugate components of (u| 


Uji jojs (10.69) 


In this case, it is easy to see that there is only one way to put the indices 
together (because of the total symmetry of u and v) 


Tiik WE virt (10.70) 
Thus the matrix element is determined by a single reduced matrix element 


(u|W |v) = Ai uj 4 We vit (10.71) 


10.13 * Tensors for SU (2) 


You may be wondering why we have spent so much time on the description 
oí tensors for SU (3), while we barely mentioned them for SU(2). The an- 
swer to this question is really more history and sociology than it is math or 
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physics. The description of irreducible SU (2) representations in term of ten- 
sors is simply not used very much, perhaps because it is not needed. Everyone 
knows how to deal with the SU (2) irreducible representations in other ways. 
Nevertheless, tensor analysis is useful for SU (2), and it may be instructive to 
see how it works. 

Tensor analysis for SU (2) is even simpler than for SU (3) because there 
is only one fundamental representation for SU(2) — the doublet. Thus we 
can look only at tensors with upper indices, and there is no trace constraint. 
The € symbol now has two indices, rather than three in SU(3). So while 
we could use the invariant e tensor to lower two upper indices in SU(3), 
in SU(2), the € just eats two upper indices completely. This implies that 
the irreducible representations correspond to completely symmetric tensors, 
because if a tensor with n indices is not completely symmetric, then we can 
get a non-zero tensor with n —2 indices by contracting two indices with e, and 
that would reduce the representation. Obviously, the completely symmetric 
tensor with n indices corresponds to the irreducible spin-n/2 representation, 
because the highest weight state, in which all the indices are 1, has J3 — n/2. 

Because the SU (2) irreducible representations correspond to such sim- 
ple tensors, we can write them down explicitly. Consider the tensor with 2s 
indices, corresponding to the spin-s representation, and let us pick out the 
component with J3 = m. Because each 1 index carries J3 = 1/2 and each 2 


index carries J3 = —1/2, the J3 = m state must correspond to a tensor with 
S +m 1s and s — m 2s. Let us define the tensor 
vit du (10.72) 


to be a completely symmetric tensor which is equal to 1 if s + m indices are 
1 and zero otherwise. There are 


( e ) E OEN (10.73) 


Stm) (s 4- m)! (s —m)! 


ways to pick the s +m 1s out of the 2s indices. Thus vs is a sum of ( 2) 


terms. For example, 
vitis = OF OP ai + St OS? Of? + oh 5? of (10.74) 
The state |vsm) is not properly normalized. The squared norm gets a 
contribution | from each of the ( 25 ) terms in (10.72). Thus the properly 


stm 
normalized state is 


2s \ 712 
|s, m) = ER [Us m} (10.75) 
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It is easy to check that this definition behaves properly under the SU (2) rais- 
ing and lowering operators. We will next illustrate its use. 


10.14 * Clebsch-Gordan coefficients from tensors 


Consider the product [Ne "T 
Pn m (10.76) 
From the general properties of tensors, we know that it transforms like a 


component in the tensor product of s; and s2. The corresponding normalized 
state is easy to write down, because (10.76) is simply a product: 


25, —1/2 259 -1/2 
31,52, m, mz) = " qm (s Lr) I"si mi ` Vaza) 
(10.77) 


The tensor product (10.76) lives in the space of tensors which are sym- 
metric in the 2s; i indices and separately symmetric in the 2s2 7 indices. 
What we would like to do is to decompose this tensor product space into sub- 
spaces that transform irreducibly under SU (2). We can do this by combining 
tensors corresponding to the various spin states in s; ® s2 with invariant ten- 
sors to produce tensors with the appropriate symmetry properties. 

The simplest example of this is spin s1 + s2. We know that we can write 
the states of spin sı + s2 in terms of a completely symmetric tensor with 
281 + 239 indices, 


231 + 239 


1/2 
10.78 
81 à: rE |Us; 52, mima) ( ) 


[51 - 52, m4 - m3) = ( 
Since v, 15;,m m; is completely symmetric, it is symmetric in its first 25; 
and its last 252 indices, and is thus in the same space as the tensor product, 
(10.76). The scalar product of the two states, (10.77) and (10.78) is the an- 
swer to the question “What is the amplitude to find the state of spin sı + 52 in 
the tensor product state?" In other words, it is the Clebsch-Gordan coefficient, 


(51 + $2, M1 + M2 | S1, S2, M1; m3) (10.79) 
But we can also easily calculate the matrix element directly from the defini- 


tions because 


(Usi +s2,m1+m2 | Usi ' Us; ma) 
" r . d * H H H 3 
aha 2177225,J1773259 231171251 J1'7J2s9 
= X (eni Us,mi  * Us2,me (10.80) 
ipi28 
j1:773255 
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The complex conjugation is unnecessary, because the tensors are real. The 
sum looks formidable, but actually it is easy to do. Each configuration of 1s 
and 2s for the tensors on the right-hand-side will correspond to some config- 
uration for v5;,..25,, Which contains all possible configurations with the right 
total number of 1s and 2s. Thus the sum in (10.80) gives 


( 281 Y( 232 ) (10.81) 
$1 T 1i $9 + m» í 


and the result for the Clebsch-Gordan coefficient is 


(51 + s2, m3 + meg | $1, $2, M1, M2) 


28 1/2 289 1/2 281 + 252 = (10.82) 
> eae Ore eee 


You can check in special cases that this is right using raising and lowering 
operators, but tensors gave us the general result rather easily. 


10.15 * Spin sı + s2 — 1 


The component of the tensor product, (10.77), in spin sı + s2 — 1 will be 
proportional to the tensor Vs +s3—1,mı+mz. 1o compute the Clebsch-Gordan 
coefficient, we must find the normalized state proportional to this tensor in the 
space of the tensor product tensor, (10.76). That is, we must construct a tensor 
proportional to v;,..55—1,m; -m;, but with 2s; + 2s2 indices and symmetric 
in the first 25, indices, and the last 2s2 indices. To add two indices, we need 
the invariant tensor €. Thus we can form the tensor 


117 12771251 2777255 
ET ee tees (10.83) 
Now we have to symmetrize in the 7 indices and symmetrize in the 7 indices. 
Let us call the symmetrized tensor 


pit isiu jasa (10.84) 


81--52—1,m; +M2,281,252 


Each term in Y has a single e*s multiplied by vs, +s.—1,m,+m» With all the 
other indices. There are 45,52 terms in V, corresponding to the 2s; possible 
values of r and the 255 values of s. 
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The tensor Y is not properly normalized. Call the normalization factor 
N. Then the Clebsch-Gordan coefficient is 


(81 + $9 — 1, mı + ma | $1, 52, M1, M2) 


N 251 x 235 -1/2 
i £ A oa (10.85) 


pit tas Jes * 11125, 177259 
5 $14-52—1,m1-4-m7n2,251,252 Usi,m1 Us5,ma 
i25 
J1' 73259 


The sum in (10.85) is easy to simplify because each of the 45,52 terms in V 
gives the same result. This is true because these terms were constructed by 
symmetrizing in the and j indices, but the tensor product is already symmet- 
ric in the 7 and j indices. And as before, the complex conjugation is irrelevant. 
Thus we can rewrite (10.85) as 


(31 + 89 — 1l, mı + m» | $1, $2, m1, M2) 


2 —1/2 —1/2 
Gs) gg) e 
$1 +m, 82 + m2 


tigi , 22°28, f2 J2sg | $28, | 71703283 
yi € Us, 459—-1,mj+me2 'Üsimi * Use,me 


=1/2 —1/2 
=N 251 ) ( 282 ) deis 
$1 - mi $9 + m» 

[( sı— 1 Ms Cees | 289 — 1 ) 
$1-mj —1 $9 4- m3 sı +m, 89 4- m» —]1 
(10.86) 
where the last line follows by simply expanding the two non-zero values of 


c1 jn the sum to get 


i27425,j2:7j259 J. tg-t25) pode 3223 = V 12128, pide dese 
$17-52—1,mj--ma $1,mj $2,m2 $1,mj $2,m2 
12771281 
j2:73259 


(10.87) 
To calculate N in (10.85), note that we can expand one of the Vs, and 
each of the terms will give the same result because of the symmetry of the 


remaining Y, so 


IND EC 


yh 7712511773253 
$1+52—1,m1+m2,281,252 


dba: Bee (10.88) 
B üh 1271251 2777255 $1:71251717772s29 
= 48182 ps € Us) +s2—1,mi1+m2 Vs1+s2—1,mi Tm2,251,252 

ty vizs] 
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Now we can do the sum over 7, and 7j, in (10.88) by expanding V. 
What makes this doable is the fact that 
irja y; 71281 19289 ES 12:125; J2773255 
p» : V5i--52— mi -ma,251,252 = C Ysits3=1m, +m (10.89) 
î1,J1 
for some constant C, so. that 
iiji,/4277025172773255 pitts Jas 
De s Us) +s2—Lmi+me “s1+s2—1,mitme,25) 252 
ipi284 


j1'1255 

= C Y 12:125, 277255 2 "Er C 23, + 235 had 2 

Ti Usipso-l,mima| — sı +s2— 1 +m +m 
gigs, 
32773255 


(10.90) 
To compute C, note that the term in Y proportional to €?!/! gives a factor 


of 2 because " 
iS jenn 


i11 


=2 (10.91) 


The terms proportional to etim for m # 1 give 1 each because 


Y, e eim a iiim (10.92) 
i 
The sum over jı then simply reproduces v5.15! 1 m 12, Since there are 
235 — 1 possible values for Jm, such terms give 2s2 — 1 total contribution to 
C. The terms proportional to €'"! for m Æ 1 contribute another 2s; — 1. The 
other terms contribute nothing. Thus C = (24-25; — 14-255 — 1) = 2s1--255, 
and 


(10.93) 


2 285 — 2 
1/N? = 4 $1s89(2s; + 282) ( Sin ) 


81 +89 — 1+ mi +m 


and putting all this together gives 


(81-59 — l,m, + mg | 81, $2,771, m3) 


( 281 =e 282 SR 
i p ae 


( 281 Dan 2 P^ (225 1/2 
81d $9 —1-c- m; +m +s 
E (nc 1 


31 
d enis ees x 
$8; +m,—-1 $9 + m» 81 + mi 89 +m, — 1 


(10.94) 
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10.16 * Spin sı + s2 — k 


The analysis of the general case is similar to that in the previous section. The 
total spin can be written as 5; + 52 — k for k between 0 and |s; — s2|. The 
component of the tensor product, (10.77), in spin 5; + 52 — k is proportional 
to the tensor vs, 45.—k,mi-+m2- To compute the Clebsch-Gordan coefficient, 
we must find the normalized state proportional to this tensor in the space of 
the tensor product tensor, (10.76), so we must construct a tensor proportional 
tO Us, 155 -k, m, +m2> but with 25; + 252 indices and symmetric in the first 25; 
indices, and the last 252 indices. For this, we need k invariant tensors e^? to 
form the tensor 

ein E s nens aba (10.95) 
Now we symmetrize in the 7 indices and symmetrize in the j indices. The 
symmetrized tensor is 


yin rae; (10.96) 


81-F52—k,rn1 -m2,251,252 


Each term in V has k e*s multiplied by vs, +s2—k,mı+mz With the other in- 
dices. We can count the number of terms in Y as follows. The index in the 
first e can have any of 25; values, and the k index any of 252 values. Then the 
i index in the second e can have any of 25; — 1 values, and the k index any of 
259 — 1 values. And so on for k terms. However, this overcounts by a factor 
of k!, because the order of the es doesn’t matter. Thus the number of terms is 


(241)! (253)! 1 
P = Qs; — k)! (255 — k)! k! ao) 


Again, let N be the normalization factor required to properly normalize 
the tensor V. Then the Clebsch-Gordan coefficient is 


(81 + s2 — k, mı + mg | 51,52, m4, M2) 


( 251 o ( 259 ye 
sı +m, $2 + ma (10.98) 
Y Vee y yt ML 
s1+sg—k,m 1 15,251,252 $1,m1 $2,112 
5113284 
J1'73259 


As above, the sum in (10.98) can be simplified because each of the u terms 
in V gives the same result. The complex conjugation is irrelevant as usual . 
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Thus we can rewrite (10.98) as 


(s1 + $2 — k, m3 + mg | 81, 82, m1, M2) 


281 =1/2 282 Eum 
c Mo 
$1 +t mi $9 + m3 


DW "X ikJk . iki 7125, jr+1 73259 g 1177125, , ji233 
D € € Us1--53 —k,m1--ma Usi,mi Uss,ma 
Her (10.99) 


21777259 


281 -1/2 289 -1/2 
a deu a 
sı + mi $2 + m5 
Se y (*) ( 2s, — k Y 259 — k ) 
c= £ ss tm —k-£ 89 + M — £ 
where in the last line we have expanded all the es. 


To calculate N in (10.98), note that 
y 2191282 


2 
1/N = , > 81--52—k,m14-(m23,251,2$2 
ME (10.100) 


hj255 


Tk 1770254 Jk 1777252 pit tas 173283 
s1+s2—k,mi+me2 81 +82—k,mi+m2,251,252 


=p POVERI SIS 
Now, again, we can do the sum over 7; and jı by expanding V. The result 
will be proportional to V with two fewer indices 


25 ci yrs j1j255 =C yis j27j253 
81-s2—k,midma,251,252 — 81 52—k,mi-tm2,251 —1,252—1 


(10.101) 
because the sum over 7; and 7; doesn't affect the symmetry in the other in- 
dices. We can find the coefficient C' in (10.101) by picking out all contribu- 
tions to any single term in the expansion of the right-hand-side, for example, 


DII 


ioe... „ikje SR 0281 Jk+1 J282 
€222 .. . g'kJk M kesak anmam? (10.102) 
We will now look at all terms in V5, +s2—k,mı+m2,2s1,2sz that contribute 
to the particular term (10.102) in Vs 4.59~k,m1+m2,2s1—1,2s,—1 and find their 
contributions to the coefficient C. 
The term 


i1Ji 1272 ... ikjk ik+1 7 t2s1Jk+1 J282 
eTe EEU ee Sk MEET? (10.103) 


contributes 2. 
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Each of the 2s) — k terms of the form 


i1jm fi2j2 ikje Perl Io Jer Im 1 1 Jm-k1773253 
cime pet ncc (10.104) 


for m > k contributes 1. 
Each of the 2s, — k terms of the form 


imji ijz... ikje SEL Imo 131 Ím el 428; Jk+1 J282 
ge € sisak miima (10.105) 


for m > k contributes 1. 
Each of the k — 1 terms of the form 


iljm zimJ1,12j2 ,,, eim-1jm—1imtijmt1 .,. Ik theres, Jk 41773252 
s £ ig € € £ $152 — kymi - ma 
(10.106) 


for 2 < m < k contributes 1. 
Putting all this together gives C = (25; + 259 — k + 1) and thus 
Y eiiis jv7j2s2 


81 +s2—k,mi+me2 ,251,253 


iji te (10.107) 
= 12:125] J277259 
a (251 t 255 -k+ 1) Va tsak mda s2] 
Now we iterate the procedure, and do the sum over 7 and 7? in the same 
way, then 43 and j3 and so on. In general 
ip41jp41 yip 773251 Jp1773255 
2 £ Vs1+s2-k;mı+m2,251—p,2s2—p 
ip+1Ĵp+1 (10.108) 
=C pirtet2s1 Ip+2°"-J2s9 
— Yp "sics2—k,mi--m2,251 —p—1,255—p—1 
where Co = C and 


Ék-p17$2s, Jk 1773252 2o te1°'t281 Jkt1'J2sq 
Ma das ke mcis Ja: skies 7 “si tse—k,mitme (10.109) 
As we did before for Co, to calculate Cp, we can look at all contributions 
to a single term in the expansion of the right-hand-side, for example, 
ip+2jp+2 ... cikik 89170251 JR 17772 
c'»*2Jp* €kJk P mm (10.110) 
We will now look at all terms in V5, +5.—k.m1+m2,2s1—p,2s2—p that con- 
tribute to the particular term (10.110) in Vs, 45.—km+mo,2s1—p—1,2s2—p—1 
and find their contributions to the coefficient Cp. 
The term 


ip+1jp+1cipt2jp+2 ... cikik 4170251177323 
€ € EE tsrk bi (10.111) 
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contributes 2. 
Each of the 2s — k terms of the form 


ikp1 771254 Jk 1 jm-1 Iptl Jm41°"-J2s9 
5152—k,mi ma (10.112) 


cip 1m cip+2Ip+2 , . . gib) y 
for m > k contributes 1. 
Each of the 2s; — k terms of the form 


imjp+1 -ipt2dp+2 ... cikdk „ikti iml ip-E1im41:712s1 Jk 1773252 
€ € EDI smi eg (10.113) 


for m > k contributes 1. 
Each of the k — p — 1 terms of the form 


igiti25] Jii 253 


iptijm -imJp+1 cipt2ipt2 ... cim—1Jm-1gim+iimtl ... ctkdk 
e f 5 e € E Usi +s2—k,mitme 


(10.114) 
for 2 < m < k contributes 1. 
Thus 
C, = 28, 4-252 —k — pl (10.115) 
and 
x iji... cide tpg -t2syFk41°J28q 4 451770253 J1773252 
i: DL Us +s9—k,mi+me2 si+s2—k,m1+mz2,251,282 
j di. 
T AE fii dass |" 
k41770251 Jk41 77725 
= IT Cp » artes board dio ? 
p=0 ik 728, 
Jk-p1773253 
pi (2s; + 2s2— k +1)! ( 2s, + 2sq — 2k ) 
(251 + 259 — 2k + 1)! 81 + $9 +k + m4 +m 
(10.116) 
and 
2 2s2— k+ 1)! 2 289 — 2 
UN =p (251 + 232 +1) ( S1 + 482 ) (10.117) 
(2s, + 259 — 2k + 1)! 81-89 — lm mi 4 m; 


Now we have all the pieces, and can put them together into the Clebsch- 


164 CHAPTER 10. TENSOR METHODS 
Gordan coefficient for |s; + s2 — k, mı + mg) in |s1, m1) x |s2, m3), 


($1 + s2 — k, mi + mg | $1, 82, m1, M2) 


_ 25s; + 239 — 2k 25 289 1/2 
2i lo QE m x DNE 
(251)! (253)! V? / (254 + 282 — 2k + 1) 172 
( k! ) (mE) 


£ (F 25i —k 259 — k 
pes ) (^) aa Cee) 
m 2s; + 259 — 2k 281 232 -i2 
E (Meo Suet) (iu) " 23] 
281 239 1/2 2s, +289 —k+1 -1/2 
(ACA (CT) 
S (5( 2s, — k Y 2s — k ) 
pa £ sy +m —k+2é s2 +M — £ 
(10.118) 
This relation is a bit complicated for humans to use, but it is easy to code 


it into a computer program that computes arbitrary Clebsch-Gordan coeffi- 
cients, and it is nice example of the power of tensor methods. 


Problems 

10.A. Decompose the product of tensor components ut v/* where v/* = 
v*) transforms like a 6 of SU (3). 

10.B. Find the matrix elements (u|T; |v) where Ta are the SU (3) gener- 
ators and |u) and |v) are tensors in the adjoint representation of SU (3) with 
components u‘ and vt. Write the result in terms of the tensor components 


and the A, matrices of (7.4). 


10.C. In the 6 of SU(3), for each weight find the corresponding tensor 
component v^. 


*10.D. 
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a. Use the SU (2) tensor methods described in this chapter to redo prob- 
lem V.C. 


b. Check equation (10.86) by using the highest weight procedure to find 
the state |3/2, 1/2) in the tensor product of spin 3/2 states and spin 1 states 
and compare with the result from (10.86) for the Clebsch-Gordan coefficient 


(3/2, 1,3/2, 1/2 | 1,3/2,0,1/2) 


*10.E. Consider mP scattering, ignoring isospin breaking. There are 
three easily accessible processes (because it is hard to make 7? beams or 
neutron targets): 


ntp —+ ntp with amplitude A, = (z*p|Hr|r*p) 
"T pm p with amplitude A. = (v p|Hy|1~ p) (1) 


xp — n?n with amplitude Ao, = (n9n|H;|n- p) 


where Hy is the interaction Hamiltonian, which is approximately SU (3) in- 
variant. If we describe the pion and nucleon wave functions in terms of 
SU (2) tensors, 1^ = J for the pions and N5 for the nucleons, we can 
write the most general amplitude for this process as follows: 


(x N|Hy|x N) = Ay Tjk Nert" No + Ao Tj Non) N* (2) 


a. Use (2) and the techniques described in the notes on tensor analysis 
for SU (2) to write the three scattering amplitudes in (1) in terms of A; and 
A» and find the relation among the three amplitudes in (1). 


b. The scattering amplitudes in (1) can also be written in terms of 7 — 
3/2 and I = 1/2 amplitudes by decomposing the rN states into irreducible 
I = 3/2 and J = 1/2 representations and using Schur’s lemma. Find these 
amplitudes in terms of A; and Ao. 


Chapter 11 


Hypercharge and Strangeness 


Let's now go back to the 1950s to the discovery of strange particles. By this 
time, isospin was well established as an approximate symmetry of the strong 
interactions, broken by the weak and electromagnetic interactions. Experi- 
menters began to notice a strange new class of particles which were produced 
in pairs in scattering of strongly interacting particles, and decayed much more 
slowly back into ordinary particles. It was eventually realized that a lot of the 
physics could be understood if there were an additive quantum number, called 
strangeness that was conserved by the strong (and electromagnetic) interac- 
tions, but not by weak interactions. Call it S. Particles which carry a non-zero 
value of S are called strange particles. 


11.1 The eight-fold way 


The lightest strange particles are the K mesons, an isospin doublet with S = 
1 
K* with I; =1/2 


K?’ with B =—1/2 eh 
and their antiparticles with S = —1 
K” with I; =1/2 (112) 


K- with I; =—-1/2 


Like the 75, the Ks have spin zero and baryon number zero. 
There are also strange baryons, particles with baryon number +1. With 
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S = —1 there is an isotriplet (that is an isospin 1 representation) 


xt with J. 3=1 
X? with Iz =0 
3X ^ with J3 = —1 
and an isosinglet (isospin 0) 
A with Jz; =0 


With S = —2 there is an isodoublet (isospin 1/2) 


=° with h = 1/2 
S with Ia = —1/2 


All of these particles satisfy 


Q-T, Y/2 


167 


(11.3) 


(11.4) 


(11.5) 


(11.6) 


where Q is the electric charge in units of the proton charge, 73 is the third 
component of isospin, and Y is a quantum number called hypercharge, de- 


fined by 


Y=B+S 


(11.7) 


If we plot 73 versus Y for the baryons, we get a plot that suggests the 8 
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dimensional adjoint representation of SU (3): 


T 
Y. 
N P 
— 5- X0 A yt—— (11.8) 
T3 — 
ES =? 
To make the hexagon regular, we can choose 
H,=T3; Ho=Vv3Y/2 (11.9) 
T 
H» 
N P 
- 0 Po (11.10) 
2 3A » Hi 


Something similar works for the light spin 0 mesons if we add the some- 
what heavier 7 meson that was not discovered when SU(3) was first dis- 
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cussed. 
T 
Hə 
K? Kt 
EN 0 ER (11.11) 
d Tan i Hi — 
K- re 


It turns out that the SU (3) symmetry suggested by these plots is not a sym- 
metry of what were thought of as the strong interactions in the early 1960. 
But then Gell-Mann realized that you could imagine dividing the strong in- 
teractions into two parts: 


very strong interactions, invariant under and SU (3) symmetry 
under which the light baryons and mesons transform like 8s 
and; 
(11.12) 
medium strong interactions, which break SU (3), but conserve 
isospin generated by 71, Tz and 75, and hypercharge gen- 
erated by 2T3/V/3. 


Then he showed that SU(3) symmetry could be used in perturbation theory, 
like isospin, to understand a lot about the strong interactions. 


11.2 The Gell-Mann Okubo formula 


SU (3) is much more than just taxonomy. Consider the baryon masses. We 
can write the baryon states as 


Bi) (11.13) 


where Bi is a tensor (wave function) that labels the particular baryon we are 
interested in. For example, the proton state corresponds to 


BY = n js (11.14) 
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We can survey all the states at once by writing the following matrix: 


E9 QA + 
. V + Js Es P 
? = — A 
B; = x 5 4 Js N (11.15) 
z- z0 -2A 
E T V6 / ij 
Now 
Bij) = PIP) - NIN) +-+ (11.16) 


The entries of the matrix are convenient labels for keeping track of the states 
— P is the proton wave function, A is the A wave function, and so on. 

If SU (3) were an exact symmetry, all the particles of the octet would 
have the same mass. Because it is broken, the masses will be different. But 
if, as Gell-Mann assumed in (11.12), the SU (3) conserving very strong in- 
teractions are much more important than the SU (3) breaking medium strong 
interactions, the differences between different particle masses within the rep- 
resentations, the mass splittings, should be small compared to the average 
mass. Explicitly, the mass is the energy of a particle state at rest, so we want 
to calculate the matrix element of the Hamiltonian between the various single 
particle states. We will ignore the weak and electromagnetic interactions for 
now. The weak interactions give a negligible contribution to masses. The 
electromagnetic interactions are more interesting, because while their effect 
is small, it is significant because it violates isospin symmetry. We will come 
back to this later. Then the matrix element we want is 


(B|Hs|B) = (B|Hvs|B) + (B|Hus|B) (11.17) 


where Hys commutes with the SU (3) generators and Hms commutes with 
the isospin and hypercharge generators, but not with the others. The first 
term contributes a common mass to each particle in the SU (3) representation. 
In general, we cannot say anything very interesting about the second term. 
However, there is reason to believe that H ms transforms like the 8 component 
of an octet of SU(3). Actually, when Gell-Mann first did the calculation, 
he didn't have any reason to believe this. But it is the simplest possibility 
consistent with the isospin and hypercharge conservation assumed in (11.12). 
The point is that we know the strong interaction Hamiltonian commutes with 
the isospin and hypercharge generators. Thus it must be made up of tensor 
operators that have components with J = 0 and Y = 0. The 8 is the smallest 
nontrivial representation that has a state with J = Y = 0. The J = Y = 0 
state corresponds to the generator Tg. So Gell-Mann just assumed that Hms 
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transformed like the Tg component of an 8 in order to be able to use the 
Wigner-Eckart theorem to get a result. In particular, he assumed 


Hus = [Ts]; O] (11.18) 
where O is a tensor operator transforming like an 8 under SU (3), 
In. oj | = oO} [Ta] = OF [Ta]? (11.19) 


In (11.18) we are using the matrix element [Ts]; as a tensor component to 


pick out the component of the operator that commutes with J and Y. 
Now we can use tensor methods to compute the matrix element 


(B|Hus|B) (11.20) 


This is a calculation we have already done. The two independent SU (3) 
invariant combinations of the tensor coefficients are 


B; (Tal BF = Tr (Bt Ts B) 


i " i (11.21) 
B; Bj [Ts]! = Tr (Bt B T8) 
where we have used the fact that in matrix notation, 
B > B! = B*T (11.22) 


because B involves complex conjugation and interchanging of upper and 
lower indices, which is equivalent to transposition. 

Thus purely from SU (3) symmetry and the assumption that Hys is a 
component of an octet we know that 


(B|Hys|B) 
= X Tr (Bt By) +Y Tr (Bİ Ts B) 
= X (IB! Bl} + [Bt B$ - 2[Bt B3) / v2 (11.23) 


+Y ([B B']] + [B B} — 2[B BY8) / 12 
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So we need to compute the diagonal elements of B! B and B Bt 


2 
A m 2 
Bt Bju = | + =| o-[EX[-[E 
-50 A 2 
Bt Bj; = |Z? + —-—| [E 
[B' B]; = |Z*| vat V6 | | 
—2A |? 
[Bİ Blss = lp + In + |e 
, (11.24) 
[BBY = PEE. UE 
D>) A 
B Bl; = |x-[ + +s} nf 
[ J22 | |? E x In] 
2 
tee = Ia- + (20? + | 224 
[BP = Ie P + ef + | 
Thus 
(B|Hys|B) 
= X (Iz? + [EP — |AP - 2INP) /v12 aia 
+Y (IEP + INP — |AP — 22°) /vI2 


where we sum over particle types in each isospin representation. Now it is 
easy to find the contribution to each particle mass by picking out the right 
tensor coefficient. Adding the common mass Mo from the very strong inter- 


actions, we have 


My = Mo — 2X/V12 + Y/ V12 
My = Mo + X/V12 + Y/V12 


Ma = Mo — X/V12—Y/V12 


(11.26) 


Mz = Mo + X/V12 — 2Y / V12 


Thus we have expressed the four masses in terms of three parameters. We 
know nothing (from the symmetry) about the values of Mo, X, and Y, but 
we can eliminate these dynamical quantities and get one relation that follows 


purely from the symmetry — 


2(My + Mz) = 3M, + Mg 


(11.27) 
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This is the Gell-Mann-Okubo formula. And it works very well. For exam- 
ple, it implies 


Mss S (My + Mz) — Ms) (11.28) 
Putting in (in MeV) 
My = 940 My = 1190 Mz = 1320 (11.29) 
gives 
Ma = 1110 (exp. 1115) (11.30) 


compared to the experimental value of 1115, the difference is less than 1%. 
Considering that isospin breaking is bigger than this, it is much better than 
we have any right to expect. 


11.3 Hadron resonances 


Particles like the baryons and mesons that participate in the strong interac- 
tions are generically called hadrons. The baryons and mesons that we have 
discussed are the lightest hadrons. But there are also an enormous number of 
excited states of these light states that can be produced in particle collisions 
but decay back into the light states so quickly that they appear only as en- 
hancements in the scattering cross-section. The first hadron resonance to be 
discovered was the A, which shows up as a very large enhancement in the 
T P scattering cross-section at about 1230 MeV for angular momentum 3/2. 
The resonance appears in all the charge states, from +2 to —1, so this is a 
spin 3/2, isospin 3/2 state. It is part of a 10 of SU(3). All the other states in 
the 10 have now been observed. 


T 
Ho 
A~ A? At Att 
*— *0 *+ (11.31) 
p» M p» Hs 
hae a0 
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When Gell-Mann first discussed SU (3), the Q7 had not yet been observed. 
Gell-Mann was able to predict not only its existence, but also its mass. Let us 
repeat his calculation. 
Again, we assume that Hms transforms like the 8 components of an 
octet, and compute 
(B*|Hys|B*) (11.32) 


where we have called the decuplet wavefunction B* and 
|B*) = BY re) (11.33) 


is the general decuplet state. We have already done the tensor analysis, and 
in this case, we can immediately write down the result just by thinking. Since 
there is only one reduced matrix element, the matrix elements of all octet op- 
erators are proportional (component by component). Thus the matrix element 
we want is proportional to the matrix element of the generator, Tg, and thus to 
the hypercharge, Y . This means that we predict equal spacing for the isospin 
representations 


My- — MA = Mz. — My» = Mo- ~ Mz- (11.34) 
Experimentally, in MeV, 
Ma = 1230 My» = 1385 Mz» = 1530 (11.35) 


The spacings are nearly equal, and the average is about 150, thus we expect 
the QT at about 1680. Gell-Mann was even able to predict the fate of the 
Q7. With the predicted mass, it could not decay into two lighter particles 
conserving strangeness and baryon number. The lightest pair of particles with 
baryon number 1 and strangeness —3 is the K ^ and E? (or K? and =~) with 
a total mass of about 1815. Thus Gell-Mann predicted that the QT would 
look not like a resonance, but like a weakly decaying particle, decaying into 
states with strangeness —2, Ex and AK~. Sure enough, the (1^ was seen 
(first in bubble chamber photographs) with a mass of 1672 MeV. This was 
very convincing evidence that SU(3) is a good approximate symmetry of the 
strong interactions. 


11.4 Quarks 


Today, this may seem rather trivial, because we now know that all of these 
strongly interacting particles are built out of the three light quarks, u, d and 
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s, transforming like the 3 of SU (3). 


T 
Hə 
d u 
(11.36) 
Hi > 
E 
The baryons can be built of three quarks because 
3Q@3@3=100686@8el (11.37) 
To see this, note that 3 @ 3 = 6 @ 3, so 
38383 = (683) 6 (383) (11.38) 
and 6 & 3 looks like 
ud yk = T C v* T u v) + uj v!) 
3 (11.39) 


a : 
+3 (et eemn ud y^ + hey ul™ v^) 


which is 10 @ 8. Thus both the 10 and the 8 of baryons that we have al- 
ready seen can be built out of 3 quarks (the 1 also appears, in higher angular 
momentum states). 

The corresponding antiquarks transform like a 3 


D 
S 


(11.40) 


The mesons are built out of quark plus antiquark. Since 3 8 3 = 8 8 1, 
this is either an octet like the 7, K, ņ states we have already seen, or a singlet, 
like the 7/' (actually, because of the medium strong interactions, the 7 and n’ 
mix slightly). 
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The quarks have spin 1/2, and as we will see later, carry another property, 
color, that is essential for the understanding of the strong interactions. They 
also have baryon number 1/3 because three quarks are required to make a 
baryon. The u and d quarks have zero strangeness, and thus their hypercharge 
is 1/3. The s quark has strangeness —1, and Y — —2/3. For the quarks, the 
electric charge is 


Q-—T3-Y/2 


| 
eo C e 
Ne” 


Cae © 


(11.41) 


H 
KS 


Quarks were originally introduced as a mathematical device, a shorthand for 
doing SU(3) calculations. Today, though we know that quarks are real and 
we have a detailed understanding of many aspects of the strong interactions. 
Still, however, there is much that we cannot calculate. We are often forced to 
fall back on symmetry arguments. 

I can’t resist doing one more example of an SU (3) relation from the early 
days. The octet of spin 1/2 baryons have magnetic moments. Unlike the elec- 
tron magnetic moment, however, the baryon moments cannot be calculated 
just from their masses and quantum electrodynamics. They depend on the 
internal structure of the particles. But we can use SU(3) to say a lot about 
them. The crucial observation is that the operator that describes the mag- 
netic moment, whatever it is, must be proportional to Q, the electric charge 
of the quarks. It is therefore an SU(3) octet operator, and we can use the 
Wigner-Eckart theorem. We expect 


p(B) = aTr(B B! Q) + 8'TY(B! BQ) (11.42) 


Thus we expect 6 relations among the 8 magnetic moments (there is actually 
a 9th, because it is a transition magnetic moment that is responsible for the 
electromagnetic decay, £? — Ay). In fact, all the magnetic moments can be 
calculated in terms of (P) and (N). These predictions were first worked 
out by Sidney Coleman and Shelly Glashow in 1961. 
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Problems 


11.A. What would the Gell-Mann-Okubo argument tell you about the 
masses of particles transforming like a 6 of SU (3)? 


11.B. Compare the probability for A^ production in 39 P — At with 
the probability for X*? production in K- P — X:*9, assuming SU(3) sym- 
metry of the S-matrix. 


11.C. Use the SU(3) argument discussed in the chapter to repeat the 
calculation of Coleman and Glashow, predicting all the spin 1/2 baryon mag- 
netic moments in terms of u(P) and u(N). 


Chapter 12 


Young Tableaux 


We discussed Young tableaux in connection with the irreducible represen- 
tations of the symmetric groups. We will now see that they are useful for 
dealing with irreducible representations of Lie groups. We will begin by dis- 
cussing this for SU (3), but the real advantage is that it generalizes to SU (n). 


12.1 Raising the indices 


The crucial observation is that the 3 representation is an antisymmetric com- 
bination of two 3s, so we really do not need the second fundamental repre- 
sentation to construct higher representations. We can write an arbitrary repre- 
sentation as a tensor product of 3s with appropriate symmetry. In fact, as we 
have seen, Young tableaux correspond to irreducible representations of the 
permutation group, and the connection with the irreducible representations of 
SU(3) (and SU(N), as we will see later) is that the irreducible representa- 
tions of SU(3) transform irreducibly under permutation of the labels of the 
indices. 
Consider a general representation, (n,m). It is a tensor (in the old lan- 
guage) with components E 
A (12.1) 
separately symmetric in upper and lower indices, and traceless. We can raise 
all the lower indices with e tensors to get 


git in Em bn 


"m 12.2) 
= glikit yii gÀm Fm £m. qirin ( } 
Ji'"Jm 


Clearly, it is antisymmetric in each pair, k; ++ £j, and symmetric in the ex- 
change of pairs k;,£; € k;£;. 
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Now for each such tensor, we can associate a Young tableau: 


Ce femi] e [in] uea 
[a| [eml 


What we would like to do is to find a rule that associates with the Young 
tableau the specific symmetry of the tensor (12.2). We can do that by think- 
ing about the highest weight of the representation, (n, m). Because the low- 
ering operators preserve the symmetry, if we find the symmetry of the tensor 
components describing the highest weight, all the states will have that sym- 
metry. The highest weight is associated with the components in which all 
the is are 1, and all the k, £ pairs are 1,3. All of these can be obtained by 
antisymmetrizing the k, 2 pairs from the component in which all the ks are 
1, and all the Zs are 3. But this one component is symmetric under arbitrary 
permutations of the is and ks, and separately symmetric under permutations 
of the Zs. Thus we will obtain a tensor with the right symmetry if we start 
with an arbitrary tensor with n + 27m components, and first symmetrize all the 
is and ks, and separately the Zs, and then antisymmetrize in every k, £ pair. 
In the Young tableau language, this is very easy to state. We first symmetrize 
in the components in the rows, then antisymmetrize in the components in the 
columns. The result is symmetric in the is and in the k, £ pairs as (12.2) must 
be. But it also has a property that is the analog of tracelessness. Because we 
have raised the indices with es, the condition of tracelessness becomes 
6i, p, e aT tnim — g (12.4) 
This vanishes for a tensor with the symmetry properties just described be- 
cause of the symmetrization of the components in the rows. 
Thus a Young tableau like the one above is a rule for symmetrizing a 
tensor to project out a specific irreducible representation. For example, if 
ai1J2k1 is a general tensor with three upper indices, but no special symmetry 


property, the Young diagram 
(12.5) 
ki 


al1i2kı d gi? 


produces the tensor 


12.6 
Lg gh ( ) 


which transforms according to the (1,1) (or adjoint) representation. 
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We can generalize the concept to Young tableau with more rows. The 
general rule is the same. Put indices in the boxes. Symmetrize in the indices 
in the rows. Then antisymmetrize in the indices in the columns. 

In SU (3), the tensors corresponding to Young tableaux with more than 
three boxes in any column vanish because no tensor can be completely an- 
tisymmetric in four or more indices which take on only three values. Any 
column with three boxes corresponds to a factor of e in the three indices — 


(12.7) 


(12.8) 


describe the same representation as (12.3). 


12.2 Clebsch-Gordan decomposition 


We can now give, without proof, an algorithm for the Clebsch-Gordan decom- 
position of a tensor product. To decompose the tensor product of irreducible 
representations a and f corresponding to tableaux A and B, you build onto 
A using the boxes of B in the following way. Begin by putting as in the top 
row of B and bs in the second row. Take the boxes from the top row of B 
and put them on A, building to the right and/or down, to form legal tableau 
(that is collections of boxes in which the numbers of boxes in the rows are not 
increasing as you go down, and the numbers of boxes in the columns are not 
increasing as you go the right), with no two as in the same column. Then take 
the second row and add the boxes to each of the resulting tableau to form legal 
tableaux with one further condition. Reading along the rows from right to left 
from the top row down to the bottom row, the number of as must be greater 
than or equal to the number of bs. This avoids double counting. The tableaux 
produced by this construction correspond to the irreducible representations in 
a G B. 

Examples: 
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0em -oa ep (129) 


39893-26063 
pee nus (12.10) 
39832801 


It is less trivial to do this one the other way. 


Oe = [Ta] (12.11) 


The tableaux that are crossed out do not satisfy the constraint that the number 
of as is greater than or equal to the number of bs. Needless to say, it is easier 
to do it the other way, because you have fewer boxes to move around. 

Sometimes the tableaux that are produced in the first stage are useless, 
because there is no possible second state. Here’s an example: 


d'Bpu (12.12) 


Pu 4 RW 
"WomBetmguv 


Of course we would actually never do it this way. This is the decomposition 
of39 3 = 693. This is just the complex conjugate of 3 & 3 = 6 6 3 which 
we have already done, and which was much easier because it involved fewer 
boxes. That's a useful lesson. 
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Finally, here is a useful example which illustrates all of the rules — 8 @8 


& [ala] = (12.13) 
H- 
[ | Jala] > [T Tale?) e [ | [ofa] e [ [| fafa 
N ope [b] " 


LI Je] > ® |] la] |] Ta] 
p 
| | Jae] > [JT mbe || Je] e] I 
[| [7 B ap. 
n rá [a] 

SEL e[lle 

| [al | jal 

|a. [ajb] 


or 


e faja] = ala] e aa (12.14) 
HH 
10 


27 


e LL] e HH ® HRS? Ha 
| fale} Cial] | [bj | |a| 
n ja] [a | b 

10 8 8 1 


This analysis not only illustrates all the rules, it suggests one possible sys- 
tematic way of ordering the calculation, in this case by putting things as far 
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to the right as possible to start with, and then working to the left. It is impor- 
tant to have some such rule for your own calculations, so that you don't miss 
possibilities. 
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Young tableaux can do many other useful things for us. One that we can 
already explore in SU(3) is to help us understand how a representation of 
SU(3) decomposes into representations of its subgroups. An example of 
this which has some phenomenological importance is the decomposition of 
SU (3) representations into their representations of SU(2) x U(1), and par- 
ticularly, of isospin and hypercharge (Tg, remember, is just /3Y/2). 

The three components of the defining representation, the 3, decompose 
into a doublet with hypercharge 1/3 and a singlet with hypercharge —2/3. In 
general, consider a Young tableau with n boxes and look at the components 
in which 7 indices transform like doublets and n — j transform like singlets. 
The total hypercharge will be 


j. 2n-j) | 2n 
s En (12.15) 


We will denote the n —j singlet components by a Young tableau of n— 7 boxes 
in one row. This is the only valid Young tableau that we can build for these, 
because they all have the same index, and thus cannot appear in the same 
column. 


The SU(2) representations that these components transform under will 
be some subset of the representations that can be built out of 7 boxes. Now 
here is the point. To determine whether a given SU (2) representations, o, 
actually appears, what we do is to compute the tensor product of o with the 
n— j horizontal boxes. Then the number of times the original representation 
of SU (3) appears in the tensor product is the number of times œ appears in 
the decomposition. We will give a justification of this procedure later, when 
we generalize it. But now let us work out some examples. We will use a 
notation in which we refer to the SU(2) representation by their dimension, 
and put a subscript to indicate the hypercharge. Thus (2I + 1), stands for an 
isospin J representation with hypercharge y. 


First consider the 6. The result is illustrated in the following figure 


184 CHAPTER 12. YOUNG TABLEAUX 


(where the . represents the trivial tableau, with no boxes). 


pres (12.16) 
(m) » 
(OO) a 
( em) ee 


Each representation appears once. 
Now look at the 3. The answer here is obvious, because this is the com- 
plex conjugate of the 3. but let us see what it looks like in this language. 


A = (12.17) 


(B-) w 


(OO) 2 


Again, each representation appears once. Note that we cannot have a col- 
umn of boxes on the right hand side of the ordered pairs, because this would 
represent an antisymmetric combination of states with only one index, which 
vanishes. 

Next consider the adjoint representation, where we already know the an- 


SWeT. 
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Each representation appears once. 
Now let's do a less trivial one, for the 27 — 


HJ EN (12.19) 


Again, each representation appears once. 
Problems 


12.A. Find (2,1) & (2,1). Can you determine which representations 
appear antisymmetrically in the tensor product, and which appear symmetri- 
cally. 


12.B. Find 10 & 8. 


12.C. For any Lie group, the tensor product of the adjoint representation 
with any arbitrary nontrivial representation D must contain D (think about 
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the action of the generators on the states of D and see if you can figure out 
why this is so). In particular, you know that for any nontrivial SU (3) rep- 
resentation D, D & 8 must contain D. How can you see this using Young 


Tableaux? 


Chapter 13 


SU(N) 


We now want to generalize the discussion of the last few chapters to SU (N), 
the group of special unitary N x N matrices, generated by the hermitian, 
traceless, N x N matrices. 


13.1 Generalized Gell-Mann matrices 


There are a couple of different useful bases for the SU(N) generators. We 
will start with a generalization of the Gell-Mann matrices, in which we build 
up from the SU(N) to SU(N + 1) generators one step at a time. We will 
normalize (as in SU(2) and SU (3)) 


1 
Tr(TaTs) = 5 9a (13.1) 


The generators of the raising and lowering operators, we can take to have 
a single non-zero off-diagonal element, 1/ V2. The group is rank N — 1, 
because there are N — 1 independent traceless diagonal real matrices. We can 
choose the N — 1 Cartan generators as follows 


1 


eli = E) 


m 
> OikOjk — mima ism] (13.2) 
k=1 
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For example 
10 0 
1 0 
1 1 01 0 
His 9d th= TFs |0 0 -2 
(13.3) 
100 0 
1 010 0 
H42—|001 0 
v24|0 0 0 -3 
and so on. 


Altogether, there are N? — 1 independent traceless hermitian matrices. 
These N? — 1 matrices generate the N dimensional defining representation 
of SU(N), which we will sometimes call the N. The weights are N — 1 
dimensional vectors, 


; 1 
[V7 ]m = [H5]jj = Vm(m +1) e Ojk — ia) (13.4) 


(m 4- 1) 


These satisfy, 


"j = £4 2m(m41) p RUE 2 


m=1 k=1 
= jk +m hin) 
m= 2m(m + 1) V 
al : 
" 1 G- (13.5) 
2- mm 1) * 2j - 1) 
-ix(- 1 ee 
2 mcm (m 4 1) 2j 
|o Goh Wo 


Xj- 
N 
NE 
bo 
2 
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and for i < j, 

y! yi 
N-1 

i = imme) (Zu pon 
m=1 
m 

x (x jk - mdma) (13.6) 
k=1 

c e a. 
2 mm+1) Zj 1) 
1 


The weights all have the same length, and the angles between any two distinct 
weights are equal — 


i2 _ md in d. 
V TM d mier for iz j (13.7) 
or j 
v.v = “ay + 5 bi (13.8) 


Thus the weights form a regular figure in N — 1 dimensional space, the N — 1 
simplex. 
Explicitly, the weights look like 


pall 1 1 1 
2'2/3  'y2m(m-1)  '2(N—-l1)N 

3 1 1 1 1 

V =|- Tm , ? , 

2’ 2/3 2m(m +1) 2(N —1)N 

ps = , : , , : 
V3 2m(m + 1) 2(N — DN 

(13.9) 

y^ = [0,0,.-., L8 : 

2m(m + 1) 2(N —1)N 
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For convenience, we will choose a backwards convention for positivity 
—a positive weight is one in which the LAST non-zero component is positive. 
With this definition, the weights satisfy 


1 


ps qu ssa Sy (13.10) 


The Hs take us from one weight to another, so the roots are differences 
of weights, v* — v? for i # 7. The positive roots are v* — v? for i < 7. The 
simple roots are 


oi = rt — rit! fori=1toN-—1 (13.11) 


The roots all have length 1. They satisfy using (13.8) 


at af = 5 (big — ag — Sigs + Siegen) 
E i E i5 (13.12) 


= 6j — 50 jii 


so the Dynkin diagram is 


O-O-O-O aa» 


The fundamental weights are 
j 
p - V (13.14) 
k=1 
It is easy to check using (13.7) that these satisfy 


2" 
Rd E ds (13.15) 
a 


because the term proportional to sw in (13.8) cancels because the simple root 
is a difference of fundamental weights. Thus jj! is the highest weight of the 
defining representation. 


13.2 SU(N) tensors 
As in SU(3), we can associate states with tensors, 


ll 2 |.) (13.16) 
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then we can build up arbitrary representations as tensor products. 
Consider, for example, the antisymmetric combination of m defining rep- 
resentations. The states are 


Alii, --- im) (13.17) 


where AU: is completely antisymmetric. This set of states forms an irre- 
ducible representation, because of the antisymmetry. Because of the antisym- 
metry, no two indices can take on the same value. Thus the highest weight in 
this set of states arises when one of the indices is 1, another is 2, and so on. 
That means that the highest weight is the fundamental weight 1" 


m 
yo aye (13.18) 


Thus this representation is the fundamental representation D™. 
The highest weight of any irreducible representation can be written in 
terms of the Dynkin coefficients, q^, as 


p=} quu (13.19) 
k 


The q*s are non-negative integers. The tensor associated with this represen- 
tation has, for each k from 1 to N — 1, qx sets of k indices that are antisym- 
metric within each set. A simple generalization of the argument as for SU (3) 
shows that symmetry of this tensor can be obtained from the following Young 
tableau, with q* columns of k boxes 


(13.20) 


N-1 
boxes 


This gives a tensor of the right form for the same reason as it did in 
SU (3). The highest weight state has a term in which the top row is all 1s, the 
second is all 2s, the third is all 3s, etc. Thus the tensor we want is obtained 
by first symmetrizing in the indices in the rows, and then antisymmetrizing in 
the indices in the columns. This is exactly the symmetrization condition that 
we used at the end of chapter I to construct the irreducible representations of 
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the permutation groups. Thus the irreducible representations of SU(N) with 
m indices are associated with the irreducible representations of Sm. 

As in SU(3), tableaux with more than N boxes in any column corre- 
spond to tensors which vanish identically. Any columns with N boxes con- 
tribute a factor of the completely antisymmetric tensor with N indices (which 
we will again call e). Tableaux which are the same except for columns with 
N boxes correspond to the same irreducible representation. 

We will sometimes denote the representation corresponding to a Young 
tableau by giving the number of boxes in each column of the tableau, a series 
of non-increasing integers, (£1, £2, - - -]. In this notation, D? is [;]. 

For example the adjoint representation, in SU(N) as in SU(3), corre- 
sponds to a tensor with one upper and one lower index. To get it into the 
standard form of a tensor, we must raise the lower index with an N compo- 
nent e. Thus the adjoint is [N — 1, 1]. 

Clebsch-Gordan decomposition works the same way as for SU(3), ex- 
cept that now we need cs for the third row of the tableau, ds for the fourth, 
etc. Along with the condition that reading along the rows from right to left 
from the top row down to the bottom row, the number of as must be greater 
than or equal to the number of bs which is greater than or equal to the number 


of cs, etc. 
E ® J (13.21) 


For example, 
[2,1] e [3] 


gs 


[2] & [1] 


or 


(13.22) 


: 


[4] & [4] 


Notice that it is the rule about number of as being greater than or equal to 
the number of bs and so on, that prevents us from having more than one of 
any of these representations. Of course, if N is less that eight, some of the 
tableaux in the last example will give vanishing tensors, while others will 


Seg 


aJo Ioja 


[4,4] e [53] e [62] e [7.1] e [8] 
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have columns of N boxes that can be eliminated without changing the repre- 
sentation. 

Note, in general, in the Clebsch-Gordan decomposition process that the 
only time boxes can disappear is when columns of N boxes are removed. 
Thus each tableaux in the decomposition of a tableau with j boxes @ a tableau 
with k boxes will have a number of boxes equal to 7 4-k modulo N. This is the 
analog of the conservation of triality in SU (3). The quantity j +k mod N 
in SU(N) is sometimes denoted by the linguistic barbarism /V-ality. 


13.3 Dimensions 


For most of the small representations that we will need, it is easy to work out 
the dimension. For example, the dimension of [7] is 


(7) = LUN (13.23) 


The dimensions of the two-column representations can be worked out using 
the Clebsch-Gordan series. For example, from [1] & [2] = [2, 1] © [3], we can 
find the dimension of [2,1]. 


es : (*) k (3) Z Mv +N) (13.24) 


There is, however, a simple rule for obtaining the dimensions of any 
representation from its tableau. It is called the factors over hooks rule. It 
is a special case of Weyl’s character formula. It works as follows. Put an N 
in the upper left hand corner of the tableau. Then put factors in all the other 
boxes, by adding 1 each time you move to the right, and subtracting 1 each 
time you move down. The product of all these factors is F. A hook is a line 
passing vertically up through the bottom of some column of boxes, making a 
right hand turn in some box and passing out through the row of boxes. There 
is one hook for each box. Call the number of boxes the hook passes through 
h. Then if H is the product of the hs for all hooks (the same factor we used 
in (1.164), the dimension of the representation is 


F/H (13.25) 
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For example, for [2,1], the factors are 


(13.26) 
F2 N(N 4-1(N - 1) 


The hooks are 


E Hf (13.27) 


pae LAM (13.28) 


so H — 3. Thus 


in agreement with (13.24). 


13.4 Complex representations 


Most of the representations of SU(N) are complex. For example, the lowest 
weight of the defining representation is vN. But we know because the Cartan 
generators are traceless that 


N . 
DEI (13.29) 
j=l 
and thus 
N-1 
v =~ y) = =u! (13.30) 
j=1 


Thus [1] is complex, and its complex conjugate is DN-! or[N — 1], 


[I] 2 [N — 1] (13.31) 
Simlarly, the lowest weight of [m] is the sum of the m smallest v's 
yose en (13.32) 
j=N—m+1 j=l 


and thus 
[m] = [N — m] (13.33) 
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In general, the complex conjugate of a representation is 
[5,4] = [N — £44, N — 41] (13.34) 


Thus the tableau corresponding to a representation and its complex conjugate 
fit together into a rectangle N boxes high, as shown below: 


N boxes 


(13.35) 


13.5 SU(N)& SU(M) € SU(N 4 M) 


Now we can generalize the discussion of SU(2) x U(1) € SU(3) from the 
last chapter. Consider the SU (N)& SU(M)GU (1) subgroup of SU(N +M) 
in which the SU(N) acts on the first N indices and the SU (M) acts on the 
last M indices. Both of these subgroups commute with a U(1) which we can 
take to be M on the first N indices and —N on the last M (note that it is 
traceless). 


(l=(O mel. Dx (13.36) 


We would like to determine how an arbitrary irreducible representation 
of SU(N + M) decomposes into irreducible representations of SU(N) & 
SU(M) & U(1). Suppose that we have an SU(N + M) representation with 
n +m boxes. To compute the number of times that the product of an SU(N) 
representation A with n boxes and a SU (M) representation B with m boxes 
appears in the SU(N + M) representation when we restrict n indices to be 
from 1 to N and m to be from N +1 to N+ M, we relax that condition and let 
all the indices run over all N + M values, and find the tensor product A & B 
of the two representations (as representations of SU(N + M)). Then the 
number of times that the pair (A, B) appears in the decomposition is equal to 
the number of times the original SU(N + M) representation appears in the 
tensor product A ® B. Note also that the U (1) charge of the representation is 
nM —mN. 

Notice that if we take M — 1, the SU(M) group disappears, because 
there is no algebra SU (1) — it has no generators. However, the construction 
described above still works. This is what we used in the previous chapter 
to do the decomposition of SU (3) representations under the SU(2) x U(1) 
subgroup in (12.18) and (12.19). 
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Let us do the example SU(3) & SU(2) & U(1) € SU(5), where the 
defining representation is 


[] (13.37) 
= ET] * Jo (3, 1)2 


e ( ° O )-3 (1,2)-3 
Then the adjoint representation, which we showed above was [N — 1,1], is 


(13.38) 


-(H a), s» 
e (F CI), Q3» 
(BB. om 
«(EE B), e» 
«(H EP), &»- 


Problems 


13.A. Show that the SU (n) algebra has an SU (n — 1) subalgebra. How 
do the fundamental representations of SU (n) decompose into SU (n — 1) 
representations? 


13.B. Find [3]@[1] in SU (5). Check that the dimensions work out. 
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13.C. Find [3,1]@[2.1] 


13.D. Under the subalgebra SU(N) & SU(M) ® U(1) € SU(N+M), 
where the defining representation N+M transforms as ([1], [0]) m @((0], [1] -N 
(in the [£4, - - -] notation where the £s are the number of boxes in the columns 
of the Young tableaux, and where the first bracket indicates the SU(N) rep- 
resentation and the second the SU(M)), how do the fundamental representa- 
tions of SU (N+M) transform? How about the adjoint representation? 


13.E. Find [2] & [1, 1] in SU(N), and use the factors over hooks rule to 
check that the dimensions work out for arbitary N. 


Chapter 14 


3-D Harmonic Oscillator 


This is an important chapter, but not because the three dimensional harmonic 
oscillator is a particularly important physical system. It is, however, a beau- 
tiful illustration of how SU(N) symmetries arise in quantum mechanics. 


14.1 Raising and lowering operators 


The Hamiltonian (here as elsewhere we have set A = 1) is 
2 (14.1) 


where 


1 ; 
c a (142) 
t : 
2 TU Ty — 6 
ay Jamu ( k Dr) 


for k = 1 to 3. The a, and al are lowering and raising operators satisfying 
the commutation relations of annihilation and creation operators for bosons, 


lax, al = One 


[ax, ag] = Piet =0 
Q 
Q 
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(14.3) 
akak, al] = = al 


T aj = - ag 
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If the ground state is |0), satisfying 


a,,|0) = 0 (14.4) 
then the energy eigenstates are 
a}, --- af. |0) (14.5) 
with energy 
w (n + 3/2) (14.6) 


The degeneracy of these states is interesting — it is the number of symmetric 
combinations of the n indices, kı - - - kn, which is 


(n + 1)(n 4+ 2) 
2 


Just the dimension of the (n, 0) representation of SU(3). This suggests that 
the model has an SU (3) symmetry. 

It is not hard to find the SU (3) explicitly, because the raising and lower- 
ing operators are completely analogous mathematically to creation and anni- 
hilation operators for bosons. And we know how to form SU(3) generators 
out of creation and annihilation operators. This guess turns out to be right. 
The generators of the SU (3) symmetry on the Hilbert space are 


Qa = ah [Ta]ke ae (14.8) 


where Ta = A,/2 are the Gell-Mann matrices. You have already shown that 
these satisfy the commutation relations of SU(3) in problem (5.B), and you 
can check that 


(14.7) 


[Qa H] =0 (14.9) 
so that the energy eigenstates form representations of SU (3). Also, 
Qal0) = 0 (14.10) 


so that the ground state is an SU (3) singlet. 
Just like the creation and annihilation operators, the raising and lowering 
operators form tensor operators under the SU (3), 


Qa. aj] = a! [Talex (14.11) 
thus al transforms like a 3. The lowering operator, ag, transforms like a 3, 


(Qa, ax] = —[Ta]ke ae 


= ~ag [TT |e, = —ae [Tr le 


(14.12) 
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Because all the states in the Hilbert space can be obtained by the action of the 
raising operators on |0}, the Hilbert space decomposes into (n, 0) representa- 
tions. No other representations appear. 


14.2 Angular momentum 


Something interesting has happened here. The SU (3) symmetry is nontrivial. 
It implies that states with different angular momenta are exactly degenerate. 
Angular momentum is a subgroup of SU (3) generated by (see problem 14.C) 


L3 =2Q2, I1=2Q7, L= -2Qs5 (14.13) 


But for n > 1, the degenerate states in level n consist of more than one 
anglar momentum. For example, the n = 2 states transform like a 6 of 
the SU (3), which under angular momentum transforms like 5+1. Thus at 
this level, the symmetry guarantees a degeneracy that does not follow from 
rotation invariance alone. But notice that we didn't have to do anything to 
impose this SU (3) symmetry. We just imposed rotation invariance, and then 
the SU (3) symmetry popped out, because of the linearity of the Harmonic 
oscillator force law. Nonlinear, anharmonic effects would spoil the SU(3) 
symmetry. For example, a term in the potential like (z?)? would conserve 
angular momentum, but break the SU (3). 

This example can be extended to SU(N) if you let the number of di- 
mensions be larger or smaller. The dimensions, in this case, just give you 
more indices for your raising and lowering operator. For the harmonic os- 
cillator, this looks pretty silly, since we live in three dimensions. But in a 
relativistic theory when we describe the Hamiltonian in terms of creation and 
annihilation operators, it is perfectly natural. The first approximation to the 
Hamiltonian of any system of N different types of particles with the same 
mass will be proportional to the total number operator, ala, where the sum 
over k runs from 1 to N, just because the energy of the m particle state will 
be rn times the energy of a one particle state. This has an SU(N) symme- 
try. Of course, interactions and other effects that treat the particles differently 
may break the symmetry. 


14.3 A more complicated example 


It is very instructive to consider a slightly more complicated model in which 
all representations of SU (3) appear in the Hilbert space. Consider the fol- 


14.3. A MORE COMPLICATED EXAMPLE 201 


lowing two-particle Hamiltonian: 


R.gy (14.14) 


where the Ž and P are an independent set of coordinates and momenta. This 
describes two harmonic oscillators, in units with mjw, = Mw = 1, cou- 
pled in what appears to be a complicated way. But in fact, it is carefully 
constructed to be simple in terms of raising and lowering operators. Defining 


d!-(r-ip)v2, | @ = (F + ip)/V2, 


io. nts Set PENNE (14.15) 
bt —(R—iP)JV2, b=(R+iP)/V2 
then the Hamiltonian becomes 
H = (ala, + 3/2) + wo (blby + 3/2 
1(azay + 3/2) + wo (bib + 3/2) du 


4-44 (alb!) (aebe) 


The point of this is the form of the interaction term proportional to A, and 
in particular, the way the indices of the raising and lowering operators are 
contracted. It is constructed to commute with the operators (see problem 
14.D) 


Qa = af [Ta]ke ae — b} [Tă]ke be (14.17) 


These have the commutation relations of SU (3), so the theory has an SU (3) 
symmetry with the at transforming like a 3, as in the simple three dimensional 
harmonic oscillator, but with the b! transforming as a 3. Because we now have 
raising operators that transform like 3s as well as 3s, we can construct states 
which transform like arbitrary SU (3) representations. 

To see what the states look like, note that H also commutes with the 
number operators, 


N,-—ala, and N= blby (14.18) 


202 CHAPTER 14. 3-D HARMONIC OSCILLATOR 


Thus the energy eigenstates have definite numbers of ats and bts, and trans- 
form under irreducible representations of SU (3). They look like 


|(n, m), K) 


= (al, ab b Zr — traces) (14.19) 


an K 
(at ot) |0) 
transforming like the (n, m) of SU (3). For example, the octet states look like 


(«lol = AG : i) (at. 5t)“ 10) (14.20) 


The indices of the ats act like lower SU (3) indices while the indices of the 
bts act like upper indices. The irreducible representations must be traceless 
in a pair of an at index and a b! index. 

It is straightforward to calculate the energy eigenvalues. Because of the 
SU (3) symmetry, we can look at any state in an irreducible representation, 
and in particular, at the highest weight states 


((n, m), K) = (al) C (at. P)" jo) (14.21) 


The first two terms in H just count ats and bts. The nontrivial term is the 
interaction term proportional to A. But because we know that the state is an 
eigenstate, we can just do it. 


= (at 8) [a & (aD"] ql)" (at - 8) 1o (a) 
(at - Bt) (al)" [2-8 0h] (at 8) 19 (b) 
+E 8) d" OD less [oss (at 8^] |i (c) 


(14.22) 
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(b) works the same way, and gives 
m K |(n,m), K) (14.23) 
(c) is 


(at B) (ap 64) [oss |o (a? 8)" ]| 10 
= (at Bt) (at^ (91) [oss K al (at. NE m 


= K (d! - bt) (a])" (63) (a GE pre (14.24) 


gu lae (at. in 
= K (K +2)|(n,m), K) 
Thus the energy eigenvalue is 
w] (ne Ke 5) + we (m+K+5) +4A K (n 4- m + K +2) (14.25) 


The physics of this model is not very interesting. However, as an ex- 
ample of how symmetries arise, it is very instructive. There is a very useful 
way of thinking about what goes on here. For a moment, think of the inter- 
action term, proportional to A, as a perturbation on the harmonic oscillator 
terms. In the absence of the A term, the theory has a larger symmetry — 
SU (3) x SU (3) (and some U (1)s, but we will concentrate on the non-Abelian 
symmetries) — separate SU (3) symmetries on the a! and b! variables. The 
interaction terms couples the as and bs together and breaks the independent 
SU (3) symmetries down the single SU (3) that treats the interaction terms 
as a singlet, because a,b; and alb] are singlets under the SU (3) symmetry. 
It is often useful to organize the symmetry structure of the theory in such a 
structured way. 


Problems 
14.4. Show that the operators 
1 
ok. = alata, aa on + PEN 
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transform like a tensor operator in the (2,1) representation. 


14.B. Calculate the non-zero matrix elements of the operator O$, (where 
Ok is defined in 14.A) between states of the form 


aj (atot) 10) 


and 
alal (atot) [0) . 


14.C. Show that (14.13) generates the standard angular momentum, 


Txp. 


14.D. Show that 
[Qa akbk] = 0. 


Chapter 15 


SU (6) and the Quark Model 


In the spectrum of low-lying baryons, the octet with spin 1/2 and the decuplet 
with spin 3/2 are not very far apart in mass. Typical splittings between the 
octet and decuplet states are not so different from splittings within the SU (3) 
representations. Noticing this, in the 60's, many physicists played with the 
idea of embedding SU (3) in some larger symmetry group that would connect 
the two representations. Because the baryons in the 8 and 10 have different 
spins, the larger group cannot commute with angular momentum. You might 
expect this to cause problems, because it means mixing up internal symme- 
tries and spacetime symmetries, and indeed it does. But the problems do not 
show up until you try to make the theory relativistic. We will discuss some of 
this, and possible resolutions, later. 


15.1 Including the spin 


The obvious way to extend the SU (3) symmetry group is to include in the 
quark states the quark spin as well as the quark type. This suggests that we 
think about a 6 dimensional tensor product space in which the states have an 
SU (3) index, and a spin index — 


(Iu), 14), 1s) ([1/2),1-1/2) a5.) 


The corresponding symmetry group is an SU (6) symmetry that acts on this 
six dimensional tensor product space. We can write the generators as products 
of 3 x 3 matrices in the SU (3) space (though not necessarily traceless) and 
2 x 2 matrices in spin space (again, not necessarily traceless). In particular, 
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the generators include the SU (3) generators 

DAL (15.2) 
with the identity in spin space understood, and the spin generators 

595 (15.3) 


with the identity in the SU(3) space understood. That is, this SU(6) has an 
SU (3) subalgebra under which the 6 transforms like two 3s, and it has an 
SU (2) subalgebra under which the 6 transforms like 3 2s. The SU(3) and 
SU (2) algebras commute with one another. We say that the SU(6) has an 
SU(3) x SU (2) subgroup under which the 6 transforms like a (3,2). The 
other SU (6) generators are the products 


Daag (15.4) 
The total number of generators is then 8+3+24=35, which, sure enough, is 
the number of independent, hermitian, traceless 6 x 6 matrices. 

The low-lying baryons, consisting of three quarks, transform like the 56 
dimensional representation of SU (6), the completely symmetric combination 
of 3 6s. Let's pause to understand how such a representation transforms. 


15.2 SU(N) x SU(M) € SU(NM) 


The general situation is SU(N) x SU(M) € SU(N M). It arises only for 
SU (k) where k is not a prime, and thus this embedding does not show up in 
SU (2) or SU(3). The idea is to always exploit the idea of a tensor product 
space. For SU(N M), the defining representation has NM indices, and we 
can therefore describe it in a tensor product space, replacing the N M indices 
by an ordered pair of indices, 2, z where 7 runs from 1 to N and x runs from 
] to M. Then the matrices 


[ML e cando “Sap us (15.5) 


are hermitian and traceless (remember that in a tensor product space, taking 
the trace means contracting both types of indices) and generate SU(N) and 
SU (M ), respectively. 

Under the SU(N) x SU(M) subalgebra generated by these matrices, 
the NM transforms like (N, M) (or equivalently, ([1],[1]) — that is there 


152. SU(N) x SU(M) € SU(NM) 207 


are M copies of the N of SU(N) which transform into one another under 
the SU(M) — or vice versa, whichever you like). The reason that this is 
useful is that very often our creation operators have more than one index — 
like the SU (3) indices and spin indices in the quark states of SU (6). Then 
this kind of tensor product is the natural place for them, and such SU(N M) 
symmetries arise quite naturally. 

Now we can ask the decomposition question. Given a representation of 
SU(N M), how does it transform under SU(N) x SU(M)? 

In particular, suppose we have a representation D, of SU(N M) corre- 
sponding to some Young tableau with K boxes. In a tensor language, D is a 
tensor with K SU(N) indices and K SU(M) indices. Thus because of N- 
ality and M-ality, if the decomposition contains the representation (Dj, D2) 
where D, is an SU(N) representation with K, boxes and D» is an SU(M) 
representation with K boxes, then 


K= K; mod N= K mod M. (15.6) 


To go further, it is again easiest to argue backwards. First, add columns of 
N boxes to D; and columns of M boxes to D» if necessary to bring both 
up to a total of K boxes. Now what the decomposition means is that we can 
write the tensor described by a D tableau with (i, z) pairs in each box as a 
linear combination of products of tensors with indices in the D, tableau and 
x indices in the D» tableau. 

Now the idea is simple. Both the Dı and D» tableaux have definite sym- 
metry properties under permutations of the k indices. They are associated 
with irreducible representations of the symmetric group Sp. The question 
then is whether the product has any component that had the symmetry of D. 
If so, then (Di, D2) will appear in the decomposition of D. 

Of course, this doesn’t quite solve the problem, because we must still 
understand whether the tensor product of two irreducible representations of 
Sx contains a third. We won't try to solve this problem in general. But we 
will be able to tell in some special cases. 

For the 56 of SU (6), this analysis is simple. The Young tableau associ- 
ated with the 56 is 


BE (15.7) 


which corresponds to the trivial representation of S3. There are three possible 
irreducible representations of SU (3) with three boxes: 


ECL E- H (15.8) 
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And there are two possible irreducible representations of SU(2) with three 
boxes: 


amm HJ (15.9) 


The SU(3) x SU (2) representations in the 56 are then those ordered pairs 
of SU (3) and SU (2) representations for which the trivial representation of 
$3 appears in the tensor product of the 53 representations of their Young 
tableaux. For example, the representation 


(o-oo) (15.10) 


appears, because the completely symmetric representation is contained in its 
tensor product with itself. This corresponds to the 10 of SU (3) with spin 3/2. 


The other possibility is 
( FJ 3 FJ ) (15.11) 


The tensor product of these two 53 representations contains the trivial repre- 
sentation (as we will see explicitly) because any pair of SU (3) and SU (2) 
indices transforms the same way under any permutation, the product contains 
the representation which is unchanged by the permutation. This represen- 
tation corresponds to the 8 of SU(3) with spin 1/2. Thus the 56 includes 
precisely the low-lying baryon states. 


15.3 The baryon states 


Let's look explicitly at these states. The decuplet states are particularly simple 
because they are separately symmetric in the SU (3) and SU (2) indices. For 
example, indicating the +1/2 spin states by |+) 


[A**,3/2) = [uuu) |^) 
JAP 1/2y = s (luud) + judu) + |duu)) 


(be) + |4-+) + |-++)) 

V2 

6 

-(uds) + |dus) + |sud) + |sdu) + |dsu) + |usd)) 


(15.12) 
|5*9, 1/2) = 


(12) + |+-+) + |-++)) 
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The octet states are more complicated because they are antisymmetric in one 
pair of spins and quark labels (because of the columns in the Young tableau). 
To get a completely symmetric state, you can multiply the two and then add 
cyclic permutations. For example 

V3 


d (|uds) ~ Idus) ) (+-+) — |-++)) 
+(|sud) — |sdu) ) (I — m) 
+(|dsu) — [usd) ) (I--9 — e) 


(15.13) 


In this notation, the quark label and spin states are correlated. The first quark 
label goes with the first spin states, and so on. The way this A state works 
is that if you interchange both spin and quark labels of any pair, one of these 
three terms is unchanged, while the other two get interchanged. Note the nor- 
malization of the state. It is often more convenient to construct these states 
by multiplying an SU (3) state that is symmetric in some pair of indices, like 
|wud) which is symmetric in the first two, by a spin state which is symmetric 
in the same pair, but with total spin 1/2. Then again, adding cyclic permuta- 
tions gives a totally symmetric state. For example, 


|P, 1/2) -2 


diea (21) ~— |+-+) - |-+4)) 
(15.14) 
+|udu) (2|+-+) - |-++) — |[++-)) 


+(|duu) (2|-+4) — |++-) — e) 


In the first term, for example, the |+—+) state and the |-++) state must have 
the same coefficient for symmetry under interchange of the first two quarks, 
but the |++-—) state is automatically symmetric, so it can have a different co- 
efficient. We have chosen the coefficients so that the sum vanishes, which 
guarantees that the state will be orthogonal to the symmetric spin 3/2 state. 
This can never give a decuplet state. If the SU (3) state is completely symmet- 
ric, the sum over cyclic permutations gives zero rather than a nontrivial state. 
Again, note that if you interchange both spin and quark labels of any pair, one 
of these three terms is unchanged, while the other two get interchanged. 
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15.4 Magnetic moments 


Let us now see what SU (6) symmetry has to say about the magnetic mo- 
ments of the baryons. To determine the SU (6) properties of the quarks, note 
that if a quark is a point particle like the electron, its magnetic moment is 
approximately 


$ ga (15.15) 
2m 


where m is the quark mass and eQ is the quark charge. This looks like an 
SU (6) generator, so we infer that the magnetic moment operator transforms 
like the 35 dimensional adjoint representation. 

We are interested in the matrix elements 


(56|35|56) (15.16) 


€ 
EE 
id 


Because there is a unique 56 in the decomposition, we know that the matrix 
elements are determined by a single reduced matrix element. Therefore, we 
can compute the magnetic moments up to an overall constant by looking at the 
matrix element of any operator that transforms like the correct component of a 
35. In particular, we can use the SU (6) generators themselves, and conclude 


But 35 & 56 is 


use x (56| Q d |56) (15.18) 


We can use this relation to compute the ratio of the proton to neutron magnetic 
moments, which was not determined by the Coleman-Glashow analysis. If 
we compute the matrix element between states with the same o3 value, then 
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only the Qc matrix element is not zero, so we can compute that. We do this 
just by applying the tensor-product rule: 


Q o3 |P.1/2) > 


Dee (219) — |+-+) + |-++)) 
+5luud) (21) + |+-+) - |-++)) (15.19) 
~ = |uud) (-2|++-) — |+-+) - |-+4)) 


+cyclic permutations 


Thus 


P,1/2|Qo3|P, 1/2 

( d [2) ; ; Gn 
=3—-(-(44+1-1)+-(4-141)-5(-44+1+]))}=1 

Jj (tree cd ples ) 

The matrix element between neutron states can be obtained by simply inter- 

changing the us and the ds in this calculation, which means interchanging the 

2/3 and —1/3 factors — 


(N,1/2|Qo3|N, 1/2) 


2 1 2 
= 3--(--(4+1-1)-— =(4-14+1)+ 3(-44+1+1)) =--> 
oO Cane Cis) PIC EbEDe as 
(15.21) 
So we expect 
3 
ease (15.22) 
LN 2 
Experimentally, in nuclear magnetons (units of e/2mp) 
pp = 2.79 py = —1.91 
(15.23) 
BE SCA 
BN 


So it works, but the logic is rather indirect. From the quark model, we ex- 
tracted only the SU(6) transformation law of the magnetic moment operator. 
We then used the Wigner-Eckart theorem to show that we can calculate the 
ratio by looking at the corresponding matrix elements of an SU (6) generator. 
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We can get the result more easily by simply assuming that the quarks 
actually exist as nonrelativistic constituents, and that the baryons really are 
bound states of three quarks in an angular momentum zero state. Then the 
magnetic moment of the baryon is simply the sum of the magnetic moments 
of the quarks. The magnetic moment of a quark (by analogy with an electron) 
is ge in the direction of the spin where m is the quark mass, thus it can be 
written 


L gg (15.24) 
2m 


Therefore in the tensor product language, the sum of the quark magnetic mo- 
ments is d 
$5 D Që (15.25) 
quarks 


The proton magnetic moment is Eni up where mp is the proton mass and 


Lup is the magnetic moment measured in "nuclear magnetons". Thus 


e e 
———up = —(P|Qo|P)/(Ple|P) = — 15.26 
Face = gg (PIQAIP)/(PIAIP) = 5— (1529 
which is exactly what we computed in the SU(6) argument, but with one 
additional bit of information — we know the scale. If we assume that the 
quark mass is about 1/3 the proton mass, we have 
ips beg (15.27) 
m 
This is not bad! 

The quark model is thus not only simpler, it is more predictive. It also 
gives a reasonable account of some SU (3) breaking effect in the baryon mag- 
netic moments. For example, the Coleman-Glashow prediction for the A 
magnetic moment is 


HA -3ur = —.93 (15.28) 


You can see this directly in SU (6) by computing 


1 
(A, 1/2|Qos3lA, 1/2) = E (15.29) 
But experimentally 
LA = —.61 (15.30) 
which is significantly different. The quark model gives 
e e 
—pr = c (15.31) 
2m, pe », 2Mquark Ses 
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In the matrix element, only the s quark actually contributes in the A, (15.13), 
because the u and d spins are combined into a spin zero state. Thus we predict 


WA Ee ee (15.32) 


In the quark model, we expect the s quark to be heavier than the u and d 
quarks to account for the larger mass of hadrons containing the s quark. For 
example, if we assume (very roughly) m4 z m; + 2m and mp % 3m, then 


gba = mp... Agha (15.33) 


SO 
ua = —.64 (15.34) 


which is a bit better. 
Problems 


15.A. Find the SU (6) (i.e. quark model) wave functions for all the spin 
1/2 baryons except P, N and A (which were discussed in the text). 


15.B. Use the wave functions you found in (15.A) to calculate the mag- 
netic moments, 


a. in the SU (6) limit, calculating the ratios to yp; 
b. in the quark model, put in SU (3) symmetry breaking by including 
ms F ™My,d- 


15.C. Show that the |A, 1/2) state, (15.13), is an isospin singlet. 


Chapter 16 


Color 


There are some things wrong with the simple quark model discussed in the 
previous chapter. The first is that the connection between spin and statistics is 
wrong for these quarks. The quarks must have spin 1/2 in order to produce the 
spin 1/2 and 3/2 baryons. Thus we would expect them to obey Fermi-Dirac 
statistics, and we would expect the ground state of the three quark system to 
be an s-wave, completely symmetric under the exchange of the position labels 
of the three quarks. This would lead to an SU (6) representation completely 
antisymmetric in the SU (6) indices of the three quarks — which is a [3] or a 
20 which transforms under SU (3) x SU(2) as ([2, 1], [2, 1]) ® ([3], [1, 1, 1]) 
or (8,2) @ (1, 4) which is not what we want. 

The second difficulty is that we need some explanation of why only the 
Gq and qqq combinations seem to exist. This would certainly not be explained 
by any simple attractive force between quarks. 

Finally, there is now a tremendous amount of evidence from studies of 
high energy QCD that the quarks carry an attribute which we call color. 
Quarks come in three colors, and the colors interact with a set of 8 gluons 
in an SU(3) symmetric way. The SU (3) associated with the color interac- 
tion is called color SU(3). It has nothing to do with Gell-Mann's SU(3), 
except, of course, that it provides the force that binds the light quarks into 
Gell-Mann's representations. 


16.1 Colored quarks 


The quark states transform like a 3 — they have a single color SU (3) index 
which I will let run from 1 to 3. The quark wave functions are tensors 


d fori=1to3. (16.1) 
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They also carry labels for Gell-Mann's SU (3) (flavor), spin and position. The 
idea is that the color interaction binds three quarks into a color singlet baryon 
state, which is then described by contracting the color wave functions of the 
quarks with an e — 


eijk d^ q) q* (16.2) 


Then, because the e is completely antisymmetric, the baryon state is symmet- 
ric under the exchange of all the other labels. 


It is this binding of three quarks into an SU (3) singlet baryon that sug- 
gested the name “color” for the color SU (3) indices. The metaphor is based 
on the fact that colorless light can be produced by combining beams of the 
three primary colors, red, green and blue. In the same way, a colorless (that 
is color SU (3) singlet) baryon state can be made out of three colored quarks. 
Thus the color SU(3) indices are sometimes referred to as red, green and 
blue, rather than 1, 2 and 3. 


I think that it was the use of the term color that gave rise, in turn, to the 
name flavor for the label, u, d, s, etc, that distinguishes one type of quark 
from another. Gell-Mann's SU (3) is sometimes referred to as flavor SU (3) 
because it transforms the light flavors into one another. Flavor is a completely 
independent attribute. The quarks carry both color and flavor (and also spin). 


Antiquarks transform like 3s under color SU (3), because color charges, 
at least those in the Cartan subgroup that can be diagonalized, are quantum 
numbers like electric charge and baryon number, and like electric charge and 
baryon number, they change sign in going from a particle to its antiparticle. 
If the Cartan charges change sign, that means that the weights change sign, 
and that means going from the 3 representation to its complex conjugate, the 
3. 


We can make a color singlet state from three antiquarks by contracting 
with an € — 


e Tiq Gr (16.3) 


This describes an antibaryon. Or we can make color singlet states out of one 
quark and one antiquark — 


d' d; (16.4) 


These are the mesons. They include the pseudoscalar meson octet, the pseu- 
doscalar singlet 7’ and also an octet plus singlet of spin 1 vector mesons, 
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shown below 


K*?? K** 


(16.5) 


Kt x 


Why does the color force bind these states? A partial answer is that it 
behaves like the electromagnetic force. The gluons couple to color charges, 
that is the color SU(3) generators, just as the photon, the particle of light, 
couples to electric charge (the generator of a U(1) symmetry). The electro- 
magnetic force between two objects is attractive when the product of their 
electric charges is negative. This is why the electromagnetic force tends to 
bind charged particles into neutral atoms and molecules. One difference with 
color is that there are 8 gluons, instead of one photon, and the color interac- 
tion between two colored particles is proportional to the sum of the products 
of their color charges. If this is negative, the force is attractive. 

Specifically, consider a state of two particles, A and B, transforming 
according to some representations of color SU (3): 


TA|r, A) = |s, A) [TA] sr 
a r, A) = |s, A) [Za] (166) 


T7 |x, B) = |y, B) [Tz ]yz 


where r, s (x,y) are the color indices for the A (B) representation. The two 
particle state is then a tensor product 


|v, A, B) = vrz|r, A)|z, B) (16.7) 


16.1. COLORED QUARKS 217 


Now the color interaction is proportional to the product of the charges, 
TA TP (16.8) 


summed over a, in the usual sense of the tensor product space. This is invari- 
ant under SU(3) because the generator on the tensor product space is 


T, = TÅ TP (16.9) 
which commutes with (16.8) because 
Ir? + TP, TA TP 
= [r4 TA] r£ e T [r2 m? (16.10) 
=i fare (TA TË + Tf TË) =0 


The eigenstates of (16.8) are irreducible representations of color SU (3) 
because of Schur’s lemma. To compute the eigenvalues, it is convenient to 
write 1 ; 

TATE = 0; - TA? _ TP’) (16.11) 


The object T? (summed over a) is called a Casimir operator (it is the analog 
of J? in angular momentum SU(2)). It is easy to see that it commutes with 
the SU (3) generators and is therefore a number on each irreducible represen- 


tation. Note that TA and TP ? are fixed because they are properties of the 
particles involved, but T2 depends on how the A and B states are combined 
into a state with definite color — that is, it depends on how the particular lin- 
ear combination of tensor product states we are looking at transforms under 
SU (3), and it has definite values on states that transform under an irreducible 
representation. Loosely speaking, T2 measures the size of the color repre- 
sentation. The smaller T2, the less “colorful” the state. Thus the color force 
is most attractive in the least colorful states, and therefore it tends to bind 
quarks and antiquarks into the least colorful siates possible. This is a step in 
the right direction. 

In a qq state, the most attractive state will be an SU(3) singlet (with 
T? = 0). In a three quark state, we can consider each pair of quarks in turn. 
If the colors of a quark pair are combined symmetrically, the qq state is a 6. 
If they are combined antisymmetrically, the state is a 3. 72 is 4/3 for the 3 
and 10/3 for the 6, thus the antisymmetric combination is the most attractive. 
Thus the most favored state is one that is antisymmetric in the colors of each 
quark pair, which is the color singlet, completely antisymmetric state. 
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16.2 Quantum Chromodynamics 


There is much more to color. The quantum theory of the color interaction 
of quarks and gluons is called QCD, Quantum Chromodynamics, in anal- 
ogy with the quantum theory of the electromagnetic interactions of electrons 
and photons, QED, Quantum Electrodynamics. There is a dimensional 
parameter built into the QCD theory, Aqcp, a few hundred MeV. The QCD 
interaction is rather weak for distances smaller than 1/ Aqcp, however, it gets 
strong for distances larger than 1/Aqcp. We now believe that the quarks are 
permanently confined by the strong long-distance QCD interactions inside 
colorless hadrons, so that we can never completely isolate a colored quark 
from its hadronic surrounding. This doesn't mean that we cannot see quarks. 
In fact, we see quarks and gluons rather directly in scattering experiments at 
energies and momenta much larger than Agcp. But it does mean that quarks 
do not show up directly in the low energy spectrum. 

With this picture of the strong interactions, we can understand why Gell- 
Mann's SU (3) is a useful symmetry. The interaction of the gluons is the same 
for each of the quarks. The only thing that distinguishes between quarks (in 
the strong interactions) is their mass. A mass term in the QC D Hamiltonian 
looks like 

m4 üu-4- mqdd 4 m;35s (16.12) 


where u, d, s (u, d, 3) are annihilation (creation) operators for the u, d and s 
quarks. This can be rewritten 


(m, + mq + ms) (uu *- dd -55)/3 (a) 
(m, — mg) (uu — dd)/2 (b) 
+(2m, — m, — Ma) (23s — uu — dd)/6 (c) 


(a) is an SU(3) invariant. (b) breaks isospin symmetry, which is one rea- 
son that we think that the u-d mass difference is very small, a few MeV. 
This term is a small perturbation that you can ignore unless you are inter- 
ested in isospin breaking. It happens to have effects roughly the same size 
as the electromagnetic interactions, even though it probably has nothing to 
do with electromagnetism. (c) is the term that is responsible for most of 
the breaking of Gell-Mann's SU (3). It is fairly small, because the s — u and 
s — d quark mass differences, as measured, for example by the mass splittings 
within SU (3) representations, are not large compared to Agcp. Notice that 
(c) is a tensor operator, the 8 component of an octet. This is why Gell-Mann's 
original guess about the transformation law of the medium strong term in the 
Hamiltonian actually worked. We no longer have to guess because we have 
some understanding of the dynamics. 
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16.3 Heavy quarks 


As you probably know, there are other quarks besides the u, d and s quarks. 
The c (for charm) quark, the b (for bottom or beauty) quark and the £ (for 
top or truth) quark have all been seen in high energy collisions. They are 
all unstable, decaying back into the lighter quarks very quickly. But the c 
and b quarks last long enough to bind into hadrons just as the lighter quarks 
do. So there is now a rich phenomenology of particles containing c and b 
quarks. These particle had not been seen, and were barely even imagined 
when Gell-Mann first explored the consequences of SU (3) symmetry. 

How do these states transform under Gell-Mann's SU(3)? You should 
be able to guess what these states look like. There are meson states and 
baryon states. Meson states with a heavy b or c antiquark and a light u, d or 
s quark transform like the 3, because the light quarks are a 3. Meson states 
with a heavy b or c quark and a light u, d or s antiquark transform like the 
3, because the light antiquarks are a 3. Meson states in which both the quark 
and the antiquark are heavy are SU(3) singlets. They transform trivially 
under Gell-Mann’s SU (3) because the heavy quarks do not carry any SU (3) 
properties at all. The baryons containing a single heavy quark and two light 
quarks transform like the 6 and the 3 of SU (3). And so on. 


16.4 Flavor SU(4) is useless! 


Every so often, someone gets the deceptively attractive idea of enlarging Gell- 
Mann's flavor SU (3) symmetry to an SU (4) or an SU (5) including the c or 
the c and b quark. Alas, this seemingly obvious extension is quite useless. 
The trouble is that the masses of the c and b quarks are so much larger than 
the light quark masses, and so different from each other, that mass differ- 
ences involving these quarks are much larger than Aocp. The perturbative 
description of the breaking of SU (3) that makes Gell-Mann's SU (3) useful 
is not appropriate in the extensions to SU(4) or SU(5). Don't be fooled. 


Problems 


16.A. Finda relation between the sum of the products of the color charges 
in the qq state in a meson and a the qq pair in a baryon. 


16.B. Suppose that a “quix”, Q, a particle transforming like a 6 under 
color SU (3) exists. What kinds of bound states would you expect with one 
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quix and additional quarks or antiquarks? How do these states transform 
under Gell-Mann's SU(3)? Hints: The quix is a flavor singlet, because 
Gell-Mann's SU (3) just transforms the light quarks. Also, you should only 
include states whose wave functions cannot be factored into two independent 
color singlets. For example, thinking just about the light quarks, you would 
not include a qqqqq state, because you can show that every wave function you 
can write down factors into a color singlet qq and a color singlet gqq. The 
corresponding state would presumably fall apart into a meson and a baryon. 


16.C. Here is a convenient way to calculate the Casimir operators, C(D) 
(— T2) for small representations, D. Note that 


Tr T = dim(D) C(D) = V Tr(TaTa) = Y, kp = 8kp 


where kp is defined in (6.2) and dim(D) is the dimension of the representa- 
tion D. kp — 1/2 for D — 3 (or3) so C(D) — 4/3. But kp behaves ina 
simple way under © and G: 


(a) kp,eDs = kp, + kp, 


(b) kp,@p, = dim(Di) kp, + dim(D2) kp, 


Prove (a) and (b) and use them to calculate C(8), C(10), and C(6). 


Chapter 17 


Constituent Quarks 


The color SU(3) theory of hadrons is a genuinely strongly interacting the- 
ory. There is no obvious small parameter that we can use to express QCD 
predictions in a perturbation series. At present, first principles QCD calcula- 
tions can be done only by large-scale computer evaluations of the functional 
integral. While such calculations have yielded important evidence that QCD 
can explain the spectrum of light hadrons, the analysis is far from complete. 
Given this complexity, it is remarkable that a simple non-relativistic quark- 
model gives a reasonable qualitative picture of light hadron masses. In this 
section, we will go over this picture briefly. 


17.1 The nonrelativistic limit 


If all the quarks had masses very large compared to the QCD parameter, A, 
we could justify the nonrelativistic quark model in a simple way. In that 
case, the QCD interaction would behave rather like ordinary electrodynamics, 
except for a few peculiarities associated with the non-Abelian behavior of the 
Gluons, as we discussed in the previous chapter. The baryons would simply 
be nonrelativistic bound states. We would be able to organize a calculation 
of the bound-state energies in an expansion in inverse powers of the large 
quark masses. The leading contribution would be simply the sum of the quark 
masses. In next order, there would be a contribution from the color interaction 
depending on the space wave function but independent of the quark spins. In 
the limit that the u, d and s are degenerate, we would have an approximate 
SU(6) symmetry like that discussed in Chapter 15. The leading SU (6)- 
breaking spin dependent interactions would be suppressed by inverse powers 
of the masses, because they are all relativistic effects of color magnetism. The 
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most important such effect for the ground state (which should be primarily an 
£ = 0 state) would be the color magnetic moment interactions between the 
constituent quark spins. 

This picture works surprisingly well even though the quarks are light 
(though it must be supplemented to describe the pseudoscalar mesons — not 
a surprise, because the pion is too light to be thought of as a nonrelativistic 
bound state). We do not understand why it works as well as it does, but let's 
see how it gives a picture of the masses of the ground states in the baryon 
and meson sectors, the S'U (6) 56 of baryons (the spin 1/2 octet and spin 3/2 
decuplet) and the quark-antiquark bound states in the 6 & 6 (the spin 1 and O 
octets and singlets). 

What is really interesting about this picture is the way the spin-dependent 
interactions work. If the ground-states are primarily s-wave, there will be 
no contribution from a single color-magnetic moment interaction. However, 
there will be important contributions from gluon exchange between quarks 
that depend on the relative orientation of the color dipoles. Except for the 
color factor, this interaction looks like that between electron magnetic mo- 
ments, or between current loops. If two current loops are sitting on top of one 
another, the state in which the dipole moments are aligned has lower energy 
than the state in which they are paired (antiparallel). 

In both the baryon and the meson, the color force between pairs is at- 
tractive. Thus the magnetic moment interaction is like the magnetic moment 
interaction between an electron and a positron, with opposite charges (as ex- 
plained in Chapter 16, it is the magic of color that allows each of the three 
pairs in the proton to behave as if they have “opposite” charges). Therefore 
aligned magnetic moments correspond to paired spins, and vice versa. Thus 
we expect that the state in which the spins are paired has lower energy than 
the state in which they are aligned. Formally, there is a term in the Hamilto- 
nian that looks like 


f aö; (17.1) 


where the o's are the Pauli matrices acting on the spins and « is some func- 
tion of the quark positions. The factors of 1/m; are there because the color 
magnetic moment is inversely proportional to the quark mass. To the extent 
that the wave functions are independent of the type of quark (which they are 
in the SU(3) limit) and quark spin (the limit we are considering), « is just a 
constant on the whole 56 or 35. 

The spin-spin interaction, (17.1), makes the spin 1 mesons, in which the 
spins are aligned, heavier than the spin O mesons in which they are paired. 
It also makes the baryon decuplet, in which every pair is aligned, heavier 
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than the octet, in which some spins are paired. In particular, if we ignore the 
mass differences between quarks it clearly just depends on the total spin, S, 
because we can write it as 


= 53 (x 8i 8;- Y. a. J (17.2) 


where n = 3 for baryons and 2 for mesons. 

The real beauty of this picture is that in the masses of the ground state 
particles, we see the quark mass dependence of the color magnetic interac- 
tion. For example, the splitting between =* and E is smaller than the A-N 
splitting because it involves the heavier s quarks. Likewise in the meson sys- 
tem, the K*-K splitting is smaller than the p-7 splitting for the same reason. 
The most amusing example along these lines is the X-A splitting. These two 
baryons consist of the same quarks, one u, one d and one s, and they have the 
same total spin. The difference is the way in which the spins are put together. 
In the X, because the isospin is 1, the u and d quarks are in a symmetric fla- 
vor state, and thus the spins must be aligned, because the state must also be 
symmetric in spin space. In the A, the isospin is O, the u and d quarks are in 
an antisymmetric flavor state, and thus the spins are paired. The A has lower 
energy because the color magnetic interaction between the lighter u and d 
quarks is more important. 

Furthermore, the ratio of ms to Mm, g required is about the same in the 
mesons and the baryons. You will show this in problem 17.B. 

Although this simple picture was developed in the early days of QCD, 
it is still not known for certain whether its success is simply fortuitous, or 
whether it is telling us something important about QCD, or both! Probably, 
both. Certainly, there are reasons to believe that the picture is more compli- 
cated, because there are other ways of estimating the quark mass ratios that 
give very different results. But it is also true that the constituent quark model 
picture described in this chapter goes over very smoothly to a sensible de- 
scription of mesons and baryons containing heavy c and b quarks. Thus even 
though, as explained in the last chapter, we cannot use more powerful sym- 
metry arguments to understand these states because the symmetries are badly 
broken, we can still understand a lot about them by using the quark model 
directly. The simultaneous success of the constituent quark model for both 
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heavy and light quark states convinces me that it does capture at least some 
important part of the physics of QCD bound states. 


Problems 


17.A. Suppose the quix, Q, described in (16.B) is a heavy spin zero 
particle. Then in the ground state qqQ bound states of an antiquix and two 
light quarks (any of u, d or s), the only spin dependence should come from 
the light quarks (because the ground state presumably has orbital angular 
momentum zero). Discuss the spectrum of all the qqQ ground state particles, 
spin O and spin 1, giving their SU(3) properties and their spins. Hint: In 
the ground state, we expect the space wave function of the two quarks to be 
symmetric under exchange. 


17.B. Estimate m, 4/ rn, by comparing the p — m mass splitting with 
the K* — K mass splitting. Make an independent estimate of the ratio using 
appropriate combinations of the Z*-X and Z-A. mass differences. 


*17.C. Discuss semiquantitatively the mass spectrum of baryon states 
with a single c quark and a pair of light quarks (various combinations of u, d 
and s). You will need to find some experimental information about some of 
these bound states to get started. You will also need to think about which of 
the formulas in this chapter you can trust in this context. 


Chapter 18 


Unified Theories and SU (5) 


The forces of the standard model of Elementary Particle Physics are shown 
in the table below: 


“Known” Forces 


Ces [ ERM [weak | sone | 
range | oe |10 em 108m | 
LIN NECEM 
panicle [photon | Wand Z | guons | 
ms | 0 [= i00mp ooz] | 9 _| 


Gravity is separate, because if we were only interested in the physics of indi- 
vidual particles, we wouldn't know about it at all. It is only because we have 
some experience with huge collections of particles put together into planets 
and stars that we know about gravity. 


18.1 Grand unification 


Each of the other forces is associated with a Lie Algebra. This suggests that 
it may be possible to unify all the particle interactions as different aspects of 
a single underlying interaction, based on a single simple Lie algebra. 

We will see in this chapter that all the particle interactions fit very neatly 
into the simple Lie algebra SU (5). We will also discuss other embeddings, 
based on larger algebras. These theories are called Grand Unified theories, 
where the term “grand” is added for obscure historical reasons. We will not 
be able to discuss the full structure of these theories without the language of 
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quantum field theory. But we can, at least, exhibit the Lie algebraic struc- 
ture of grand unified theories in some detail. I hope that this will whet your 
appetite for a more complete study of the physics behind the group theory.! 

We have already discussed the color SU (3) theory of the strong interac- 
tions. The SU (2) x U(1) theory of the electroweak interactions is slightly 
more complicated. We will give only a superficial introduction here. 


18.2 Parity violation, helicity and handedness 


One of the salient features of the weak interactions is that they violate par- 
ity. Parity is the symmetry in which the signs of all space coordinates are 
changed. It is equivalent (up to a rotation) to reflection in a mirror. Spin 1/2 
particles like electrons and quarks, if they are moving, can be characterized 
by their helicity, the component of the spin in the direction of motion. For a 
spin 1/2 particle, the helicity is +1/2. Particles with helicity 1/2 are said to be 
right-handed. Those with helicity —1/2 are said to be left-handed. For rea- 
sons that are not obvious, but which follow from basic principles of quantum 
field theory, the antiparticle of a right-handed particle is left-handed, and the 
antiparticle of a left-handed particle is right-handed. Helicity (or handedness) 
is not invariant under a parity transformation because a mirror interchanges 
left and right. Thus if some interaction acts differently on the right-handed 
and left-handed components of a particle, the interaction is parity violating. 
That is what the weak interactions do. A massive particle (at least if it carries 
a conserved particle number) must have both left-handed and right-handed 
components, because the helicity of a massive particle is not relativistically 
invariant. It changes sign depending on the reference frame. Thus electrons 
and their heavier cousins, muons and taus, and all the quarks have both left- 
and right-handed parts. But a massless particle need not have both compo- 
nents. Neutrinos are known to be very light, and for our purposes, we can 
treat them as massless? Thus far, only left-handed neutrinos and their an- 
tiparticles, right-handed antineutrinos, have been observed. 

For reasons which will become clear, it is useful to describe the symmetry 
properties of the creations and annihilation operators, rather than those of the 
states. We will restrict ourselves to the interactions of the lightest particle, 
the u and d quarks and the electron and its neutrino. The heavier particles all 
seem to be copies of one of these. 


!See H. Georgi and S. L. Glashow, Phys. Rev. Lett. 32 (1974) 438. 
There are a number of experimental indications of tiny neutrino masses. Note also that a 
neutrino could have a small mass that violates particle number. 
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The Glashow-Weinberg-Salam theory of the weak and electromagnetic 
interactions treats the creation and annihilation operators as tensor operators 
under an SU(2) x U(1) Lie algebra. We will call the SU(2) generators 
R, and the U(1) generator S. Consider the creation operators for the right- 
handed particles: 


ul, at, et, at, d, et, vi, (18.1) 


which create, respectively, right-handed u quarks, d quarks, electrons, u an- 
tiquarks, d antiquarks, positrons, and antineutrinos. The color index of the 
quark and antiquark operators is suppressed for now because it plays no role 
in the weak interactions. We will come back to it later. Color SU (3) com- 
mutes with the electroweak SU(2) x U(1). Under the SU(2) algebra, the 
positron and the antineutrino form a doublet, transforming according to the 
spin 1/2 representation. Like the proton and neutron creation operators under 
isospin, these two creation operators can be regarded as the 1 and 2 compo- 
nents of a tensor operator: 


Z =a, vt. (18.2) 


Likewise, the d and t antiquarks transform like a doublet under SU (2) 
and we can write them as components of a tensor operator: 


visa, =a. (18.3) 


The 4} field is also a tensor operator under color SU (3), transforming like a 
3, but for now, we have not written the SU (3) index explicitly. 

Then the commutation relations of these creation operators with the SU (2) 
and U (1) generators are the following: 


[Raut] =0, [s, ut] = 2ul/3; 

[Raat] =0, [s,at] = —dt/3; 

[Raet] =0, [s,et] = -et ; (18.4) 
[R91] = Bs (0%]er/2, [9,5] = - 6; 

[5,2] 225 191,42... [S5 | 1a. 


Thus all the fields are tensor operators, with y! and 7! transforming like 
doublets under the SU (2), and the rest transforming like singlets. 
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The annihilation operators for the right-handed particles are just the ad- 
joints of (18.1). They transform under the complex conjugate representa- 
tion. In particular, all the S values change sign. The creation operators for 
the left-handed particles transform like the annihilation operators for their 
right-handed antiparticles. Thus, for example, the creation operator for a left- 
handed u quark transforms like the annihilation operator from a right-handed 
u antiquark. 

The S values in (18.4) have been constructed so that the electric charge 
operator, Q, is 

Q=R +85. (18.5) 


You can check that 
[Q ui] =2ut/3,  [Q,at] =-at/s, 


ze CEU - dg. (18.6) 


[Q, e! EUM [Q.z*] =0. 


Now the idea of the electroweak standard model is that as in QCD, each 
of the generators of the Lie algebra is associated with a force particle. The 
Ra with three W,s, and the S with X. One linear combination of W3 and 
X is the photon which couples to the electric charge, Q. Thus electromag- 
netism is contained within this larger theory. The other particles, the W+ 
(corresponding to the complex combinations W; + :W3) and the Z (the com- 
bination of Ws and X orthogonal to the photon) are responsible for the weak 
interactions. 


18.3 Spontaneously broken symmetry 


There is something peculiar going on here. The SU (2) x U(1) cannot really 
be a symmetry. If it were, the weak interactions would have long range, like 
the electromagnetic interactions. Instead, the weak interactions have very 
short range and their force particles, the W and Z, are massive. Furthermore, 
if the SU (2) x U(1) symmetry remained unbroken, the quarks and the elec- 
tron would have to be massless particles, because the weak interactions treat 
their left-handed and right-handed helicity components differently, which is 
consistent with relativity only for massless particles. Some new physics gives 
mass to the quarks and leptons, and to all but one linear combination (the pho- 
ton) of R3 and X, without destroying the consistency of the theory. What is 
this physics? 


18.4. PHYSICS OF SPONTANEOUS SYMMETRY BREAKING 229 


The answer is the structure of the vacuum state. The SU (2) x U (1) gen- 
erators commute with the Hamiltonian, but the vacuum state of the world is 
not an SU(2) x U(1) singlet. Hence, SU (2) x U(1) is not a good symme- 
try on the states of the physical Hilbert space, all of which are built on the 
asymmetric vacuum state. The quarks and leptons and the W+ and Z are not 
degenerate with the massless photon in our vacuum. 

This situation is called spontaneous symmetry breaking. The SU (2) x 
U (1) symmetry is said to be spontaneously broken down to the U(1) of elec- 
tromagnetism, because only the linear combination Q = R3 + S treats the 
vacuum state of our world as a singlet. The resulting theory gives a very good 
description of the weak and electromagnetic interactions. 


18.4 Physics of spontaneous symmetry breaking 


It would be logical to explain, at this point, what new physics it is that pro- 
duces spontaneous symmetry breaking. Alas, we still do not know what this 
new physics is. There is, however, a model of the process that is easy to ex- 
plain and worth understanding. This is the hypothesis of the Higgs field. A 
field is a quantity defined at each point in space and time, like the electric and 
magnetic fields of electromagnetism. The electric and magnetic fields have 
several components that transform nontrivially under rotations and Lorentz 
transformations. But imagine that there exists a scalar field, which is in- 
variant under rotations and Lorentz transformations. Such a field can have 
a non-zero value in the vacuum state without breaking rotation symmetry or 
Lorentz invariance. If the field transforms nontrivially under SU (2) x U(1), 
a non-zero vacuum value breaks the SU (2) x U (1) symmetry spontaneously. 

In the SU(2) x U(1) electroweak theory, a Higgs field transforming like 
a doublet of SU (2) and with S = 1/2 does the trick. Such a field (call it ¢) 
transforms as 


[Rapt] = e! 6/2, [S44] = 44/2. (18.7) 


If such a field exists, its self interactions can be described by a potential, V (4) 
which is just the energy stored in a constant ¢ field. V(ó) is actually only a 
function of $! $ because it is invariant under SU (2) x U(1). Now the lowest 
energy state corresponds to the minimum value of V(ó). But it may be that 
¢ is not zero at the minimum of V (à). A simple example of a potential with 
a non-zero value of ¢ at its minimum is 


V($) =a (so v?) (18.8) 
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for A > 0. This is minimized when ¢'¢ = v?. Then, for example, we can 
take the vacuum value of ¢ to be 


$1 —0, $2 =v. (18.9) 


Notice that (18.7) implies that the combination R3 + S acting on the vacuum 
value of ¢, (18.9), gives zero, so that this particular subgroup of SU(2) x 
U(1), associated with electromagnetism, is not broken by the Higgs field. 
But any other linear combination of SU(2) x U(1) generators acting on the 
vacuum value of ¢ gives a non-zero result, which means that these generators 
are spontaneously broken — they correspond to rotations from the physical 
vacuum to an unphysical vacuum state. The precise form of the unbroken 
combination depends on the particular choice, (18.9), for the vacuum value 
of ¢. However, any other choice with ¢'¢ = v? gives the same physics 
because it is related to (18.9) by an SU(2) x U(1) transformation. Thus it 
does no harm to make the choice (18.9) which leads to a simple form for the 
unbroken U(1) generator, Q = R3 + S. 

The Higgs field also allows the electron and quarks to get mass. The 
rule is that a Higgs field can produce a mass for a spin 1/2 particle if the 
tensor product of the representation of the right-handed particle and the rep- 
resentation of the corresponding antiparticle contains the representation of 
the Higgs field or its complex conjugate.) You can check that this is the case 
for the electron and quark fields. 


18.5 Isthe Higgs real? 


There is overwhelming evidence that the SU(3) x SU (2) x U(1) model of 
the strong and electroweak interactions, with the SU(2) x U(1) symmetry 
spontaneously broken down to the U (1) of electromagnetism, is an excellent 
description of the interactions of elementary particles down to distances of 
the order of 107 !5cm. It is important to point out, however, that we do not 
yet know whether the Higgs field actually exists. It is quite possible that it 
is simply a mathematical metaphor for some other dynamics with the same 
symmetry properties. As of this writing, we do not know whether the Higgs 
is physics or mathematics. But we do know that if the Higgs is not real, there 
is some other new physics that does the symmetry breaking. One of the most 
important goals for particle physicists today is to either see the Higgs field 
directly, or to find the physics that replaces it. 


3In quantum field theory, this makes it possible to write an SU(3) x SU(2) x U(1) 
invariant interaction terms involving the Higgs field and the particle creation and annihilation 
operators that becomes a mass term when the Higgs field is replaced by its vacuum value. 
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18.6 Unification and SU (5) 


The SU (2) x U (1) symmetry of (18.4) is a partial unification of the weak and 
electromagnetic interactions. It leaves many striking features of the physics 
of our world unexplained. One of these is the quantization of electric charge. 
The hydrogen atom is known to be electrically neutral to extraordinary accu- 
racy. This implies that there is a relation between the charges of the quarks 
and that charge of the electron. However, (18.5), while it can describe the 
charges we see, does not explain them. The problem is the S generator. The 
values of R3 are quantized because of the non-Abelian nature of the SU (2) 
algebra. However, the values of S are completely arbitrary. They are cho- 
sen to describe the quantized charges we see, but if we could embed both R3 
and S into a simple group, then the values of S, like those of R3, would be 
constrained by the structure of the algebra. 

This idea was a major motivating factor in the search for unified theories 
in the early 70s. At first, we searched for a simple group that unified only the 
SU (2) x U(1) electroweak theory. This search proved fruitless. It was only 
when color SU (3) was included as well that interesting unifications could be 
obtained. 

Thus we ask, is it possible to include the color SU (3) of the strong inter- 
actions and further unify SU (3) and SU (2) x U(1) into a larger algebra, G, 
which is spontaneously broken down to SU (3) x SU(2) x U (1)? 

We will say that a set of creation operators, a},, transforms according to 
the representation (D, d), of SU(3) x SU(2) x U(1) if it satisfies 


|o aj, = aj. [Te ye ’ 
[Raat | =a, Uds (18.10) 
[s, ai] zl. 


Thus z is a color SU (3) index, associated with the SU (3) representation D. 
The r is an SU (2) index, associated with the SU (2) representation d. The 
s is the S quantum number. (18.10) and (18.5) imply that s must be simply 
the average electromagnetic charge of the representation, because for each 
representation, 

TrQ=TrR3+TrS=TrS (18.11) 


(the trace of R3 always vanishes because of theorem 8.9, or equivalently, 
because SU (2) representations are symmetrical about Rg = 0). 

We know the color SU(3) transformation properties of all the particles, 
so we can read off the representations of the creation operators for the right- 
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handed particles from (18.4): 


ul: (3, Lays, d! : (3,113, e(l) 


V! : ($,2) ye, ?': (15,2). 

(18.12) 
where we have indicated the SU (2) representations by their dimensions. The 
full SU(3) x SU(2) x U(1) representation of the creation operators for the 
right-handed particles is thus, 


(3, 1)2/3 6 (3, 1)-1/3 6 (1, 1)-1 © (3,2) 1/06 (1,25. — (1813) 


The creation operators for the left-handed fields transform like the complex 
conjugate of the representation (18.13): 


(3, 1)_2/3 ® (3, 1)1⁄3 6 (1, 1)1 © (3, 2)16 6 (1, 2)-1/2- (18.14) 


where we have used the fact that the SU(2) representation is real, 2 = 2. 
Notice that (18.13) and (18.14) are not the same, and thus the representation 
is complex because of the parity violating habits of the electroweak SU (2) x 
U(1), 

(18.13) is the starting point in the search for unifying algebras. We want 
to find an algebra G which contains SU(3) x SU(2) x U(1) as a subgroup, 
and which has a representation transforming like (18.13) under this subgroup. 
The rank of G must be at least four if it is to contain the four commuting 
generators, T3, Ts, R3 and S. The simplest possibility is to try the rank four 
algebra, SU (5). We will see later that the other simple rank 4 algebras could 
not possibly work because they do not have complex representations, and so 
there is no way they could describe the complex representation, (18.13). 

SU(5) has a five dimensional representation of course. It actually has 
two, because the 5 is complex, and thus the 5 and 5 (or [1] and [4]) are not 
equivalent. Can we find an SU (2) x U(1) subgroup of SU (5) such that the 5 
transforms like some five dimensional subset of the creation operators? The 
only possible such subset is 


(3, 1)-1/3 6 (1,2)1/2- (18.15) 
The five dimensional subset, 
(3, 1)a/3 6 (1,2)i/2, (18.16) 


cannot work because the generator S is not traceless on it. Thus by theorem 
8.9, S could not possibly be an SU (5) generator for this choice. 
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It is straightforward to embed SU (3) x SU (2) x U(1) in SU(5) to obtain 
(18.15). Take the SU(3) generators to be traceless matrices acting on only 
the first three indices in the 5, 


T, 0 
( 0 ij ; (18.17) 
and the SU (2) generators to be the traceless matrices acting on the last two, 
0 0 
( 0 R, ) (18.18) 
Then S is the generator that commutes with both (18.17) and (18.18), 
-I/3 0 
( 0 I 2) (18.19) 


Thus we can put the d! and 2! creation operators into an SU(5) 5, n. as 
follows: 


Al — dl, forz =1to3; 
al=l sgt, (18.20) 
tat! aot, 


What about the rest of (18.13). What remains is ul, et, and pl which 
transforms as 
(3, 1)2/3 e (1,1). 6 (3,2) 1e. (18.21) 


This representation is 10 dimensional. SU (5) has a 10 and a 10, the [2] and 
[3] representations. The 10 is an antisymmetric tensor product of two 5’s, 
so we can determine how it transforms under the SU(3) x SU(2) x U(1) 
subgroup by taking the antisymmetric product of (18.15) with itself (you can 
easily check that this is equivalent to using the general rule described in chap- 
ter 13). The SU (3) and SU (2) representations compose in the standard way. 
The S quantum number simply add. Thus, 


[(3,1)-1/s ® (1, 2)1/2] & [(3,1)-13 9 0523s] as 


= (18.22) 
= le 1)_2/3 € (1,1)1 ® (3, 2)1/6| . 


This is just the complex conjugate of the representation (18.21). Thus what 
we actually want is the 10 representation, and then we can write the remaining 
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right-handed fermion creation operators in an SU (5) representation antisym- 
metric in two upper indices, £^! = —£*71, with 
EDI — Dcu} fora b c =1to 3, 
gott = ps! — u^! fora=1to3, 
£o5t 2) 2d for a - 1to3, 
£45t — ef, 


(18.23) 


This is the standard SU(5) model, with the creation operators for the 
right- handed particles transforming like 5 & 10, or equivalently, with the 
creation operators for the left-handed particles transforming like 5 @ 10. The 
most interesting thing about it is the way that everything fits. 


18.7 Breaking SU (5) 


There are two issues that must be addressed in discussing the spontaneous 
breaking of SU(5) symmetry. 


1. How is the symmetry broken down from SU(5) to SU(3) x SU(2) x 
U(1)? 


2. How do the quarks and leptons get mass? 


Let us answer these in the Higgs field language. 

There is a very simple solution to the problem of breaking SU(5) down 
to SU(3) x SU(2) x U(1), which in fact is very much analogous to Gell- 
Mann's solution to the problem of breaking SU (3) down to the SU (2) x U (1) 
of isospin and hypercharge. The S generator in the adjoint representation has 
all the desired properties for the vacuum value of a Higgs field. Just as the 
hypercharge generator in the adjoint representation of SU (3) commutes with 
isospin and hypercharge, so the U(1) generator, S, in the adjoint of SU(5) 
commutes with SU(3) x SU(2) x U(1). Thus the 24 representation with 
a vacuum value in the S direction is a good choice for the Higgs field that 
breaks SU (5) down to SU(3) x SU(2) x U(1). 

Two conditions must be satisfied in order for a Higgs field to give mass 
directly to the fermions in an SU (5) theory? The Higgs representation (or its 


“Unlike the situation in the Higgs doublet breaking SU(2) x U(1), there are physically 
inequivalent directions in the adjoint of SU (5). One can show that the S direction is a possible 
vacuum value. 

>More complicated, indirect mechanisms involving quantum mechanical “loop” effects are 
also possible, but we will not discuss these. 
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complex conjugate) must appear in the tensor product of the SU (5) represen- 
tations in which the fermion and its antifermion appear. And the Higgs rep- 
resentation must have a component that transforms under SU (3) x SU (2) x 
U(1) like the Higgs field of the SU(3) x SU(2) x U(1) model or its complex 
conjugate. 

The right-handed positron (€) and d quark fields are in the 5, while their 
antiparticles, the electron and d fields are in the 10. In SU(5), 5 & 10 is 
[1] & [3] = [4] @ [3, 1], or 5 & 45 (see problem 13.B). 

From (18.15), you can check that the 5 contains a component that trans- 
forms like the SU (3) x SU(2) x U(1) Higgs field — the (1, 2)1/2. You can 
check (see problem 18.E) that the 45 has the desired component as well. 

Thus the electron and d quark masses can come from either of the two 
Higgs representations, 5 or 45. 

The right-handed u and are both in the 10, and 


[3] & [3] = [1] + [4,2] + [3,3] = 5 + 45 + 50 (18.24) 


We already know that both the 5 and 45 of SU(5) Higgs fields have the 
appropriate component to act as the SU(3) x SU(2) x U(1) Higgs field, 
thus they can give mass to the u quark. The 50, however, does not have any 
component transforming like (1, 2)+1/2 under SU (3) x SU(2) x U(1) (see 
problem 18.E). Thus this representation is not useful for giving mass to the u 
quark. 


18.8 Proton decay 


Another fascinating thing about the SU (5) theory is that quarks, antiquarks 
and the electron all appear in the same irreducible representation. Because of 
this, some of the SU (5) interactions do not conserve baryon number. Thus 
SU (5) unification leads to proton decay. The proton is known to be extremely 
long-lived. But if the vacuum value of the 24 Higgs field is extremely large, 
then the interactions that cause proton decay are short range and the probabil- 
ity of two quarks inside the proton interacting to cause the proton to decay is 
very small. It turns out that we know approximately what the vacuum value 
of the 24 must be in order to explain the observed differences between the 
color SU (3), electroweak SU(2) and U(1) forces.® Thus the rate of proton 
decay can actually be predicted in a given SU (5) model. Since SU(5) was 
first found theoretically, experimenters have looked for proton decay with 


*This is not something that can be understood using group theory alone. It involves the 
dynamics. 
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more and more sensitive experiments, so far without success. In fact, the 
simplest version of the SU(5) unified theory is fairly convincingly ruled out 
by these experiments. But slightly elaborated versions of the SU (5) theory, 
particularly supersymmetric versions, are still extremely promising. 


Problems 


18.A. Check explicitly that mass terms for the electron and the u and 
d quarks are allowed in the SU(2) x U(1) model in which the symmetry is 
broken by the Higgs field of (18.7). Hint: see the discussion on page 230. 


18.B. Find the symmetric tensor product of (18.15) with itself. 
18.C. Do the same for (18.21). 
18.D. Consider the operator 

O = e eu upd, 


where u and d are quark annihilation operators. Show that if the operator 
O appears in the Hamiltonian, it has the right charge and color properties to 
allow a proton to decay into a 7° and a positron. 


18.E. How do the 45 ([4,2]) and 50 ([3,3]) of SU(5) transform under 
the SU(3) x SU(2) x U(1) subgroup? Hint: it may be easier to answer the 
question for the complex conjugate representations and then complex conju- 
gate. 


Chapter 19 


The Classical Groups 


The are four infinite series of simple Lie algebras, generating what are called 
the classical groups. The first of these consists of the SU(N) algebras that 
we have studied already. Their Dynkin diagrams have the form 


O-O---O-O a9 


The rank n algebra SU (n + 1) was called An by Cartan, who first classified 
these things. In this brief section, we will go over the others very lightly, 
before we do the complete classification. Later, we will discuss each of them 
in detail. 


19.1 The SO(2n) algebras 


The orthogonal 2n x2n matrices form the group of rotations in 2n-dimensional 
real space. The group is generated by the imaginary antisymmetric 2n x 2n 
matrices, of which n(2n — 1) are independent. We can choose the Cartan 
generators as follows: 


[Hm]jk = —i(052m—16k2m — 9k 2m-19j2m) (19.2) 


This amounts to breaking up the 2n dimensional space into n different two 
dimensional subspaces on which the different Cartan generators act. H,, is 
a little Pauli matrix, c», in the appropriate 2 dimensional space, with zeros 
every where else. The eigenvectors of c» are given by 


e T) (a= i (19.3) 
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Thus the eigenvectors of the H,, are 
(|te*)]; = 9521 £ d j 2k (19.4) 


satisfying 
Hm |te*) = êkm |+e*) (19.5) 


Thus the weight vectors are + the unit vectors e* with components 
D EL (19.6) 


(as usual, you must not confuse the space on which the generators act with 
the Cartan space in which the weight vectors live). 

The roots connect one subspace with another in all possible ways, so that 
the roots vectors are 


roots te e for j#k 
positive roots ef +e* for j < k (19:) 
simple roots e) — eit! for j —1...n—1 
and e"71 + e^ 
The Dynkin diagram is 
gh7l gn 
O-O- 2 (19.8) 
elg? enl 4 gn 


The algebra SO(2n) was called D, by Cartan. 


19.2 The SO(2n + 1) algebras 


For a 2n + 1 dimensional space, we can find n two dimensional subspaces, 
but there is a one dimensional subspace left over. Thus for the rotation group 
in 2n +1 dimensional space, generated by the imaginary antisymmetric (2n + 
1) x (2n + 1) matrices, we can choose the Cartan generators as before, and 
there will be weights of the form +e* corresponding to vectors in the kth two 
dimensional subspace. But there is also a 0 weight, associated with the extra 
dimension. Again, just as before, there will be roots connecting the various 
two dimensional subspaces, with the same roots as in SO(2n). But now there 
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are also roots connecting the extra one dimensional subspace with the others, 
with roots te’. Thus the roots are 


roots tei te* for jZk and +e 
positive roots e) te* for j < k and ef 

dL (19.9) 
simple roots e) — e*t! for j —1...n—-1 

and e" 


The root e”~! + e" is not simple in SO(2n + 1) because it is a sum of 
e”~! — e" and twice e”. The Dynkin diagram is 


O-O---O-0-O (09.10) 
e” 


e! — e? 


The algebra SO(2n + 1) was called B, by Cartan. 


19.3 The Sp(2n) algebras 


Finally consider the 2n x 2n dimensional hermitian matrices which are tensor 
products of 2 x 2 matrices and n x n matrices, of the following form: 


16A, 019 5,, 02 9 $5, 703 Q $3 (19.11) 


where the c, are the Pauli matrices, A is an antisymmetric n x n matrix, 
and S1, S2 and S3 are symmetric n x n matrices. You can check that these 
close under commutation because of the properties of the Pauli matrices. The 
subset 


19A, 703 Q S3 (19.12) 
for traceless S3 generate an SU (n) subalgebra of Sp(2n) of the form 
T, 0 
( à ES (19.13) 


which generates the reducible representation n ® 7 with weights v and —v/ 
where the vJ vectors are defined in (13.9). We will take the first n —1 elements 
of our Cartan subalgebra to be the Cartan subalgebra of this SU (n) subgroup, 
Hi, Ho... H4.,. The final element of the Cartan subalgebra is then 


Hn = 03 @1/V2n (19.14) 
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All the generators of the SU (n) subalgebra commute with Hp, thus the 
roots of the subalgebra have H, = 0. We already know that the other com- 
ponents are given by the SU(n) root vectors, v? — v*. The other roots corre- 
spond to the matrices (c1 + i02) & Ske where 


[Skeliz = (Sik je + Sie 95k) (19.15) 


with Hn = +,/2/nand Hm = [vă + v4. 

I should say that this is a pretty stupid way of writing the roots. Later, 
when we talk about Sp(2n) in more detail, we will introduce a better notation 
in which all these relations are more obvious. Anyhow, if we take v”*! to be 
a unit vector orthogonal to all the v) for j = 1 to n, the roots can be written 
as 

roots vi — vf for j X k 


and + (ri Tta Vae) 
positive roots vi — v* for j «k 

and (wj +4 + /2yn*1) (19.16) 
simple roots y) — vitl for j=1...n—1 


and (^ + 2 pnt) 


The Dynkin diagram is 
O-O---0O-CcO (9.19 
vl- ry? 2u” + Jin 


where the S'U (n) roots are shorter. The algebra Sp(2n) was called Cn by 
Cartan. 


19.4 Quaternions 


A useful way of thinking about the classical groups is as rotations groups in 
various spaces. Obviously, SO(N) is the group of rotations in a real, N di- 
mensional space. The group generators, X, in the defining representation are 
pure imaginary. The product iX that is exponentiated to get the representa- 
tion of the group elements is completely real and antisymmetric. Similarly, 
SU(N) is the group of “rotations” in a complex N dimensional space that 
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preserve the norm of complex vectors. The real part of iX in the defining rep- 
resentation is antisymmetric, and generates an SO(N) subgroup of SU(N). 
The imaginary part of iX is symmetric and traceless. 

Finally, Sp(2N) can be thought of as the group of rotations in a quater- 
nionic N dimensional space, where the elements are quaternions, objects of 
the form N 

B4j.6 (19.18) 


where the components of j are the quaternionic units, satisfying 
Jajo = —Óab + €abc)c (19.19) 
Thus a vector in the N dimensional quaternionic space looks like 


q! B! +j5-C1 
Q=| : |= f (19.20) 
q^ BN +5-ON 


In this language, iX has an antisymmetric part 7A, plus three independent 
“complex” symmetric components, 


A91, j2S2, jaSs, (19.21) 


proportional to the three quaternionic imaginary units, so that the ¿X have 
the form 
tA, jisi, joSe, Jja93. (19.22) 


To see the connection of the quaternionic description of Sp(2N) with 
(19.11), note that the quaternionic units j are equivalent to —7 times the Pauli 
matrices, 

J> ið, (19.23) 


so that (19.21) is related to the terms in (19.11) proportional to the Pauli ma- 
trices. To see how the Sp(2N) transformations in a 2N dimensional space 
are equivalent to quaternionic rotations, consider simply building the N di- 
mensional space of quaternions in terms of the Pauli matrices, so that the 
quaternionic N -vector becomes a 2N x 2 matrix of the form 


Bl-ic] -Cl-ici 
B! — id. C! Cl-ic] B'+ic} 
Q> : = : : (19.24) 
BN — ig. ON BN iC" -ON -icN 
CN -iCN BN 4 ic 
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The action of (i times) the Sp(2N) generators on the complex 2N-vector 
defined by the first column of (19.24) is equivalent to the action of (19.22) on 
(19.24). 

What makes this construction work is that the real numbers, the complex 
numbers, and the quaternions share a very important property. For all of 
them, an absolute value, |z|, can be defined with the properties that 


|z1 zal = |z1| |ze| Zi = 2 = 0 (19.25) 


You are familiar with how this works for the real numbers and the complex 
numbers. For the quaternions, the absolute value is 


|B +7- ë| =\B?+Č? (19.26) 


Note that it satisfies 


|p« i-o = (B- 5.0) (B+j-6) - (B -O) (B+7-6) 
(19.27) 
where complex conjugation is defined as usual as changing the sign of the 
imaginary parts (all three of them in this case). 
The existence of the absolute value with these properties means that there 
is a natural norm defined on the N dimensional quaternionic vector space: 


N 
Ol? = >> ig? (19.28) 
j=l 


Sp(N) is the group of rotations that preserve this norm while preserving the 
structure of the quaternionic space. 

One might wonder why there are not other possibilities for classical 
groups. The reason is that it is not so easy to satisfy the condition (19.25). 
This condition, plus the requirement that addition, subtraction, multiplica- 
tion and division (except by 0) can all be consistently defined, is the defining 
property of a structure called a division algebra. There is only one other 
such structure, the octonians, with seven imaginary units, which we will dis- 
cuss further in Chapter 27. Unfortunately, the multiplication law for octoni- 
ans is not associative, so in general, rotations in an octonionic space do not 
form a group. However, the peculiar Lie algebras that do not fit into any of 
the classical groups are all related to octonions on one way or another. We 
have already seen the algebra G2, and we will discuss some of these bizarre 
algebras further in chapter 27. 
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Problems 


19.A. Consider the 36 matrices, 
Oa; Ta, Nas,  CaTb!]c 


where c, 7, and 7 are independent sets of Pauli matricies. Show that these 
matrices form a Lie algebra. Find the roots, the simple roots and the Dynkin 
diagram. What is the algebra? 


19.B. Consider the 28 matrices, 
Oa; Ta, 73; Canl, Can, Ta ; Ta?]2 ; Oa7b7)3 - 


where c, 7, and 7 are independent sets of Pauli matricies. Show that these 
matrices form a Lie algebra. Find the roots, the simple roots and the Dynkin 
diagram. What is the algebra? 


Chapter 20 


The Classification Theorem 


We will now classify all of the simple Lie algebras following the argument of 
Dynkin. We will do this with simple analytical geometrical arguments based 
on the master formula, (6.36). It may be that mathematicians can visualize 
the meaning of the arguments in multidimensional space. I can't do that. But 
the analytical arguments are simple enough. 


20.1 Il-systems 


We know that the simple roots of any simple Lie algebra have the following 
properties: 


A. They are linearly independent vectors. 
B. If a and £ are distinct simple roots, 2o - 8/a* is a non-positive integer. 


To ensure that a system of roots satisfying A and B yields a simple Lie alge- 
bra, we need one additional condition: 


C. The simple root system is indecomposable. 


A system of roots is decomposable if it can be split into two mutually or- 
thogonal subsystems. A system is indecomposable if it is not decomposable. 

It is easy to see that for decomposable simple-root systems, the simple 
roots in the two orthogonal subsystems commute (p and q are zero for all 
pairs), and the entire system of roots splits into two commuting subsets. Each 
subsystem, along with the Cartan generators associated with the subspace it 
spans, forms an invariant subalgebra. The group associated with a decom- 
posable root system is not simple. However, it is semi-simple, which means 
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that is has no Abelian invariant subalgebra. Because the subalgebras com- 
mute, the groups they generate also commute. A group of this kind that is 
built out of two commuting subgroups, G and G^», is called a direct prod- 
uct of the two subgroups, G = G4 x G2. An elementary example is the 
algebra you studied in problem 8.B, in which the Dynkin diagram consisted 
of two disconnected circles, associated with the group SU(2) x SU (2). All 
the semisimple Lie groups are direct products of simple Lie groups. 

Dynkin calls a system of vectors satisfying A, B and C a II-system. All 
we need do to classify the simple Lie algebras is to classify the possible I- 
systems. This is just geometry. We know that the system of vectors can be 
constructed (up to a conventional normalization) from the Dynkin diagram, 
and we will use the term II-system to refer interchangeably to the system of 
vectors or the Dynkin diagram that represents it. In terms of Dynkin dia- 
grams, B is just automatic, because the diagram just determines the integers; 
C is the condition that the diagram cannot be taken apart into two diagrams 
without cutting any lines; and A is equivalent to the condition that all the 
simple roots are positive, from which we showed that linear independence 
followed if B is satisfied. 

Let us now do some geometry. 

lemma 1. The only II-systems of three vectors are 


O-O-O m O-OcO e» 


This follows from the simple fact that the sum of the angles between any 
three linearly independent vectors is less than 360°. The only possible angles 
in a II-system are 90°, 120°, 135° and 150? and only one 90? degree angle 
is allowed by indecomposability, so these are the only possibilities. We will 
later see that there are three important systems that satisfy B and C, but not 


A: 
OCO m OOO a 


and 


These do not satisfy A because the sum of the angles is 360° so the three 
vectors are coplanar and thus not linearly independent. 

In fact, lemma 1 is an incredibly strong constraint, because if we take 
from any Dynkin diagram a connected subset of circles, the result is another 
Dynkin diagram. This is true because the lengths and angles are unchanged 
and (because of B) the subset of vectors must also be linearly independent. 
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Thus any indecomposable subsystem of a II-system is also a II-system. Thus 
any three connected vectors in any II-system must be in one of the two forms 
in (20.1). 

A very simple corollary of this is that no triple line can appear in a II- 
system of three or more vectors. Thus the only II-system containing a triple 


line is 
CEO 20.3) 


corresponding to the algebra Go. 

lemma 2. if a II-system contains two vectors connected by a single line, 
the diagram obtained by shrinking the line away and merging the two vectors 
into a single circle is another II-system. 

Let a and 8 be the two vectors and T the set of all the other vectors in the 
II-system. Because of lemma 1, we know that I' contains no vector connected 
to both a and 8. Thus if a vector y is connected to a, then y - 8 = 0, and if 
^y is connected to 8, then y- œ = 0. We also know that a + 9 has the same 
length as o and 8. Therefore 


if y € T is a vector connected to a, 


then y - (a+ B) —2 y: a 
yY:(a- B) =7 (20.4) 
if y' ET is a vector connected to £, 


then y: (a+ 8) — 3 8 


So the set a + 8 and T is the shrunken II-system. 

Lemma 2 has two important corollaries. a. No II-system has more than 
one double line; and b. no II-system contains a closed loop. Either configu- 
ration could be shrunk into conflict with lemma 1. 

lemma 3. If the configuration 


(20.5) 


is a II-system for some subdiagram, A, then 


A 


is also a II-system. 
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To see this, label the vectors as follows: 


We know that o - 8 = 0 and 


2a-y  2a-¥7 
dr. cce 20.8 
28-y  28-v (20:8) 


(20.9) 


Thus the following is the Dynkin diagram for the shrunken II-system: 


A corollary is that the only branches in a II-system have the form of three 
single lines coming from a central circle, like 


e (20.11) 


because anything with more branches, such as 


(20.12) 


(20.13) 


or a double line such as g 
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can be shrunk to 


C eo 2014) 


which is not a II-system because of lemma 1. 

Similarly, no Il-system contains two branches. 

This is as far as we can go with general theorems. We must now consider 
some peculiar special cases, associated, as we will see, with what are called 
the exceptional groups (meaning that they don't fall into any of the infinite 
families discussed in the previous chapter). 

lemma 4. No II-system contains any of the following diagrams: 


(c) 


O-O-O2O-O G 


All of these fail the test of linear independence, because we can find a set 
of numbers ji; such that, if the vectors in the diagram are a/, the sum 


2 
[= bj J =0 (20.15) 
j 


The appropriate values of u; are indicated inside the corresponding cir- 
cles below: 
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2) 
Oz F T aL aa. (b) 


(3) 
O—2—9—90—09-—9—90-—() (o 


In (d), the vectors do not all have the same length. The two on the right 
may be either longer or shorter than the others by a factor of v2. If they are 
longer, use 


OGTR) (d) 


If they are shorter, use 


OmOmOmOLO (a) 


We will see later that all these diagrams can be constructed with linearly 
dependent vectors. 


We now have enough information to complete the classification theorem. 
All II systems must belong to one of 4 infinite families, or be one of 5 excep- 
tional diagrams. All these are shown below, with shorter vectors indicated by 
filled circles. 
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C €—O 
^h 9-9 0-O 


(20.16) 


Nothing else is allowed. Everything else runs afoul of one of the geomet- 
rical constraints we have derived. There are no other infinite families. Adding 
more circles beyond the double line in B, or C; runs into (d), except for F4. 
Continuing beyond the branch in D;, runs into (a), (b) or (c), except for the 
E; exceptionals. And we already knew that G2 is the unique diagram with a 
triple line. 

Note some equivalences between the algebras. A1, Bı and C4 all consist 
of a single circle, and thus all describe the algebra of SU(2). In addition, 
B3 = C$. The D, family is a bit odd. As you go down in n you keep 
removing circles from the left. D4 looks like 


(20.17) 


oe (20.18) 


and D3 degenerates into 
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and thus D3 = A3. Note, further, that if we remove one more circle from D3 
to get D», it falls apart into two disconnected circles (the middle one must 
be removed to stay in the D, family). Thus D2 is not simple. This is the 
statement (an important one) that the algebra of SO(4) is the same as the 
algebra of SU(2) x SU(2) (problem 8.B). 

That is the complete list of such coincidences. All the others are distinct 
algebras. 


20.2 Regular subalgebras 


A regular subalgebra, R, of a simple Lie algebra, A, is a subalgebra such 
that the roots of R are a subset of the roots of A and the generators of the 
Cartan subalgebra are linear combinations of the Cartan generators of A. A 
regular subalgebra is called maximal if the rank of R is the same as the rank 
of A (in which case the Cartan subalgebras are identical). 

If we leave a circle out of a Dynkin diagram, the result is a diagram or 
two disconnected diagrams. These are associated with a regular subalgebra 
of the original algebra, associated with a subset of the original simple roots. 
Thus for example, the SU (n) x SU(m) subgroup of SU (n + m) is obtained 
in this way. We can repeat the procedure and find other regular subgroups. 
However, these subalgebras have lower rank than the starting group. The 
Cartan generators that have been left out generate U (1) factors of the sub- 
algebra. There are also non-maximal but regular subalgebras which cannot 
be obtained by leaving out a circle, but instead are obtained by the merging 
procedure that we used to establish the classification theorem. 

To find the semisimple maximal regular subalgebras, we can use the fol- 
lowing trick. Add to the system of simple roots, o? for j = 1 to n, the 
lowest root, o2. Because o is the lowest root, al — a is not a root, and 
therefore, by the usual argument, 


209 - ad 2a° . ad 
and 


2 ai? 


7 (20.19) 

a 
are non-positive integers. Therefore, this system of vectors satisfies the condi- 
tions for a II-system except that there is one linear relation among the vectors. 
This is called an extended II-system or an extended Dynkin diagram. 

If we remove any vector from an extended II-system, the remaining vec- 
tors are linearly independent. They still satisfy the master formula. They are 
therefore the simple roots of a regular, maximal subalgebra of the original 
algebra. However, the system may not be indecomposable, so the subalgebra 
may be semi-simple. 
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There is a unique extended II-system for each II-system, because given 
a Dynkin diagram, we can find the lowest root explicitly. In fact, we have 
already discussed all of the extended II-systems in the discussion of the clas- 
sification theorem, because they are just the systems that failed the test of 
linear independence, but were otherwise OK. Here they are: 


diagram extended diagram 
An O-O--O-O e An 
B. — O-O--O-C-9 300-09 Br, 
Cn — e-e--9-9-0 cc9-e--e-e-o Cn 
Dn OS ie “0. : o Di, 
(20.20) 
Go =? 0-0 G 
Fy eeo F; 


eec 
Es n deus Eg 


Er TE T E E E; 
Es S PM E. 


There are some exceptions for small n. A; = Bı = Cı = SU(2) and 
D» = SU(2) x SU (2) cannot be extended without changing notation because 
the highest and lowest roots of SU (2) are negatives on one another, and we 
have not included a notation for 180? in our definition of Dynkin diagrams. 


It should probably be 
(CX (20.21) 


The other exceptions are Bz and Ds, for which the indicated extensions 
do not exist because there are not enough open circles to allow for branching. 
The corresponding extended II-systems are actually C4, and A5. 

Let's work out some other examples. 

The first thing to notice is that A,, doesn't have any semi-simple regular 
maximal subalgebras. Removing any circle from Aj, just takes you back to 
Án again. 

For the B,,s, removing an open circle from the left end of B7, gives you 
back Bn again. But removing the filled circle from the right end of By, gives 
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Dn. This corresponds to the fact that SO(2n+1) has an SO(2n) subgroup. If 
instead we remove a circle from the middle of By, (for large enough n that the 
middle exists), the diagram falls apart into a Bj and a D; X, corresponding 
to the SO(2k) x SO(2n — 2k + 1) subgroup. Each of these can be further 
broken down, as well. In general, to enumerate all the semisimple regular 
maximal subalgebras, you must continue breaking things up into subgroups 
until you get to subalgebras that cannot be further broken down. 


For the Cn, removing an open circle from either end just gives you back 
Cn. Removing a filled circle from the middle breaks up Cn into C; and 
C,,—. Finally, removing the first of last filled circle gives A; x C4... This 
is really the same thing again, because Ay = SU(2) is the same algebra as 
Cı = Sp(2), so this is just a degenerate special case of the same analysis. 


For Dn, removing a circle from either end gives back D, so all we can 
do is to remove a circle from the middle to get Di x D, ., corresponding to 
SO(2k) x SO(2n — 2k). 

G2 is small enough that it is easy to enumerate all the possibilities. Re- 
moving the open circle from the right of G gives back Gz. Removing the 
filled circle from the left gives 4; = SU(3). Removing the middle circle 
gives SU(2) x SU(2). 

F4 has a B4 subgroup, obtained by removing the filled circle from the 
left. It has an A, x As subgroup obtained by removing the second filled 
circle. It has an Aj x A5 subgroup obtained by removing the left-most open 
circle. And it have a C3 x A, subgroup obtained by removing the penultimate 
open circle. 


I'll stop here to leave some questions to ask on problem sets and exams. 


20.3 Other Subalgebras 


The general subject of the subalgebras of an algebra is quite complicated. But 
the principle is a simple one. For each algebra, there is a simplest representa- 
tion, out of which all the other representations can be built. If you know how 
this representation transforms under a subalgebra, you can determine how 
any representation transforms. Conversely, each possible transformation of 
the simplest representation is associated with a different subalgebra. 


We will come back to this notion several times in the following chapters. 
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Problems 

20.A. Prove that decomposable II-systems yield decomposable root sys- 
tems. 

20.B. Find the regular maximal subalgebras of Eg. 


20.C. Find the regular maximal subalgebras of SO(12). To find them all, 
you will have to apply the extended Dynkin diagram algorithm several times, 
because some of the regular maximal subalgebras themselves have nontrivial 
regular maximal subalgebras. 


Chapter 21 


SO(2n + 1) and Spinors 


The SO(N) algebras have a fascinating property which is worth exploring 
— spinor representations. 


21.1 Fundamental weights of SO(2n + 1) 


Label the generators of SO (2n + 1) as 
Mab = —My fora, b —1to2n4 1 (21.1) 
In the 2n + 1 dimensional defining representation 
[Mas]ay =-1 Z Oby = box Say ) (21.2) 
It is easy to find the commutation relations 
Mav, Mea 
ares (21.3) 
— cc (5i. Moa — Sac Moa — 56a Mac + bad My.) 
This is the standard form for rotation generators in a real vector space. The 


commuting generators in the Cartan sub-algebra we took to be 


Hj = M»j15j for j =lton (21.4) 
We can take the roots to be 


1 
Epei = p-n tin M»jonj1) 


Epi eyes = 3(Maj-1,2k-1 + in Mojox-i (21:9) 
tin Moj-iok — n7 Mo; 2%) 
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where n, 7' = +1. These satisfy 


[Hj, Es] = n [e"]; Eo n6 Ener (21.6) 


so that the simple roots are 


o) = e — eit! forj—lton-1l 
(21.7) 


o = e” 


corresponding to the diagram 


O-O- -0-00 aus 


"LL o? exe n-l gn 


The fundamental weights are 


. j 
TR e forj-1ton-1 
Kei (21.9) 


The last one is different because of the different normalization of a”. This 
last representation is the spinor. By Wey] reflections in the roots e, we get 
from u” the set of weights 


1 
5 (te Ete)... +e") (21.10) 


All of these are unique, because they are equivalent by reflection to the high- 
est weight state. Furthermore, each could be the highest weight for some 
other definition of positivity, thus there are no other weights and the repre- 
sentation is 2” dimensional. It is convenient to treat the 2” dimensional space 
as a tensor product of n two dimensional spaces. Then an arbitrary matrix 
in the space can be built as a tensor product of Pauli matrices. Call the Pauli 
matrices for the jth space c7, so that 


|te'/2 + e?/2---+e"/2) 
= |t+e'/2) 8 |4e?/2) @--- @ |4e"/2) (21.11) 


ci |z e) EE |z' e) [Cala'a 
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where x, z' = +1/2. 
In this notation, the Cartan generators are 


l ; 
H; = 373 (21.12) 
These all satisfy 
1 
2: — 
H; = " (21.13) 
We could have chosen any M, to be a Cartan generator, so it is clear that 
2 1 
Mas = 5 (21.14) 


in this representation for any a Æ b. Now consider the roots 


1 . 
E, = yg iim + iMzj2n+1) (21.15) 


Because we can only raise a state in the representation at most once, we must 
have 


(Ea) =0 (21.16) 
Expanding out (E,;)?, you can see that this implies 
{Mo;~1,2n41, M252n41] = 0 (21.17) 
Again, the particular choice of axes is arbitrary, so we must have 
(Mj,My) =0 for jzk£zt£zj (21.18) 


Let us now construct the Es explicitly. Because E, is a raising operator, 
we know that 
E, |-el/2 + zoe?  --- tne”) 
= f(x9,--+,£n) |el /2 + ree” +--+ tne") (21.19) 
E. |-el/2 + zoe? +--+ tne") =0 


for some function, f. We can compute the norm of f using standard tricks. 


| (w2,--- an)? = (7e! /2 + age” + --- tpe™| 
E AE, |-6l/2 + 256€? +--+ tne") (21.20) 
= (-elf2 + -- -| CE a, Ea) |—e'/2 +--+) 
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but using (21.14) and (21.17) we see that! 


1 
Ua Ea) = 5 (21.21) 
and thus " 
If(2. m) = 5 (21.22) 
Now for each z5,::-,z4, we can choose the relative phase of the states so 


that f is positive. We can do this by adjusting the phases of the states with 
zı = —1/2. Then 


1 
T2,°°°, T = —— 21.23 
F( 2 n) V2 ( ) 
independent of z. Comparing with our definition of o1, we can write 
ld 
Era = 30i (21.24) 


Let us now apply the same argument to E4e2. 


Erie! — e?/2 +--+ ane") 


= fo(z1,23,::-, Tn) lae! + e”/2 To Ene”) (21.25) 
1 
where |f2(z1, 23, zs)? "s 


for some function, f2. For x; = 1/2, we can choose the relative phases of 
the states to make fy = 1/ v2, as before so that 


1 

E,2\e! /2 — e?/2 +--+) = — lel /2 + e [2 +) (21.26) 
V2 

But now we cannot change the phases of the x; = —1/2 states, because 


we have already fixed them in the previous argument. In particular, we know 
that ( E i, E+e2} = 0, and therefore we can do the following computation 


Ej|-e'/2 — e? [2 4...) 

= V2 Be E je! /2 - e [2  ---) 

= —V2 E a Ele! /2 — €? [2 4...) (21.27) 
= —E_ale'/2+¢7/2+---) 


1 
gree +e7/2+---) 


'We could equally well use the fact that [E,1, E..,1] = a. Ë. 
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That is we need an extra minus sign for the —e!/2 state. Or in terms of o7 
matrices, 


1 
Eze = 37 oÈ (21.28) 


where the tensor product is understood, as usual. 

This is easy to understand. The c1 is there to ensure that E: and E,» 
anticommute. Since E,,: is built out of a} and o, E,» must be propor- 
tional to a. 

Continuing in exactly the same way, we find 


Exo = oot et 
e (21.29) 
Big = 504 of ol 
For the hermitian generators, we can then write 
Moj-i2n4i = T eeo oi 
1 (21.30) 
Moj2n4i = 573 a loj 


Now we know everything, because we can constuct all the other genera- 
tors by commutation: 


Map =-1 [Ma2n+1, My 244] (21.31) 


for a,b A 2n + 1. Each Moy is just +1/2 times a product of Pauli matrices. 
This representation is called the spinor representation of SO(2n + 1). 
As you see, it is a generalization of the spin 1/2 representation of SO(3) (or 


SU(2)). 


21.2 Real and pseudo-real 


From (21.10), we can see that the spinor representation, 4", is equivalent to its 
complex conjugate. Nevertheless, the spinor representations have interesting 
properties under complex conjugation. Before discussing this specifically, we 
will address the problem more generally. 

Suppose that T, generates a real irreducible unitary representation of a 
simple Lie algebra, so that 


Ta = —RT* R! = —RT! RO (21.32) 
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We can prove that E is unique up to a trivial scale factor. For suppose that 
there is another non-singular matrix, Q, that also satisfies 


T,--QT;Q^ (21.33) 
Then 
T2070 (21.34) 
and thus 
T cBRQ TOR! (21.35) 
or 
es RQ" =0 (21.36) 


for all a. But if a matrix commutes with all the generators of an irreducible 
representation, Schur’s Lemma implies that it is a multiple of the identity, and 
thus 


RQ'!-AI R=\AQ (21.37) 
and thus R is essentially unique. 
Now let us use this result to show that R must be either symmetric or 
antisymmetric. Because the T, are hermitian, we can write 
T,--RITR3, TT--R UR 
>T, = RR" T, RT R! (21.38) 
= [n RT Ro] 20 
Therefore, using Schur’s Lemma again, we conclude that RTR! = XI or 


RT -AR (21.39) 


But since transposing twice gets us back to where we started, we must have 
A? = 1, or À = £1, which is the desired result. Either 


R-A--AT o R=S=S™ (21.40) 
Thus there can be two kinds of real representations. If R = 5, the represen- 


tation is called real-positive, or just real. If R = A, the representation is 
called real-negative, or pseudo-real. 
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21.3 Realrepresentations 


To see what this difference means in practice, suppose that T; is equivalent 
to a representation consisting of purely imaginary antisymmetric matrices, 


qp'eqgUppmp. T} =-T!* (21.41) 


The corresponding group representation is completely real, since it has the 


form ast 
eic Ta (21.42) 


In this case, we can compute the matrix R — 
aif z 
T! = -T!? = -UT T, T U" =U T, U 


(21.43) 
=> Ta = ~U UT T* (U UT)! 
Thus the matrix R is the symmetric matrix U UT. 

The converse is also true. Any irreducible representation of a simple Lie 
algebra with a symmetric R is equivalent to a representation that is pure imag- 
inary. Let's see how this argument goes. All the irreducible representations 
are equivalent to unitary representations, so we will assume that the genera- 
tors are hermitian. Then it is easy to see that R is proportional to a unitary 
matrix. We can write the equivalence condition as 


-TI = RT, R = Rİ T, R" (21.44) 
where the second equality follows from Hermitivity. But then 
RRt Ta = Ta R RÝ (21.45) 


and R Rt œ I by Schur's Lemma. The overall constant just cancels out in 
the similarity transformation, so we may as well take R to be unitary. Now 
suppose that it is also symmetric. It is a linear algebra fact that we can write 
a symmetric unitary matrix as a product of unitary matrix and its transpose? 


R=VVT (21.46) 
But then the generators 
T! =V T, V (21.47) 
are antisymmetric — for example, we can multiply the reality condition (21.44) 
on the left by VT and on the right by V 17 to obtain 
-VITTI VT = VIT, V. (21.48) 


?If R is symmetric and unitary, then R R* = R* R = I, and therefore the real and imag- 
inary parts of R commute. Thus they can be simultaneously diagonalized by an orthogonal 
transformation. The reader should be able to take it from there. 
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21.4 Pseudo-real representations 


The pseudo-real representations, in contrast, cannot be transformed into purely 
imaginary matrices, even though they are equivalent to their complex conju- 
gates. The simplest example of a pseudo-real representation is the spin 1/2 
representation of SU(2) generated by the Pauli matrices, T; = c4/2. Ob- 
viously, there is no way to generate this representation with antisymmetric 
imaginary 2 x 2 matrices, because there is only one such matrix and there 
are three generators. But still, the representation is equivalent to its complex 
conjugate because 

Oa = —02 al 02 (21.49) 


And indeed, the transformation matrix, c2, is antisymmetric, so the represen- 
tation is pseudo-real. 


21.5 Risaninvariant tensor 


There is another very useful way of thinking about the matrix R. It is an 
invariant tensor. To see this, note that if the representation is unitary, as we 
always assume, we can rewrite (21.32) as 


Ta, R= -RTI > T, R+ RTI =0. (21.50) 
In terms of explicit indices, this can be written 
[Ta]; R^ + (T, R^" =0 (21.51) 


which implies that R is an invariant tensor in the tensor product space D & D 
where D is generated by the Tọ. For any real representation, D & D contains 
the trivial representation, 1, only once because R is unique. If D is real- 
positive, the coefficient of the representation 1 in D & D, which is precisely 
the invariant tensor R, is symmetric in the exchange of the two equivalent 
Ds. If D is real-negative (pseudo-real), the coefficient R is antisymmetric. 


21.6 The explicit form for R 


To find the matrix R for the spinor representation of SO(2n + 1), it is enough 
to satisfy (21.32) for the generators 


1 ; ; 
Moj-1,2n41 = PEE ivre um re 
A ed Pct (21.52) 

2j2n41 = 9 93 3 92 
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for all j, because the rest can be obtained by commutation of these, and if 
(21.32) is satisfied for the generators (21.52), it is satisfied for any commu- 
tator of two of them. We can now build up R as a tensor product of 2 x 2 
matrices in the various subspaces, 


R=[[ (21.53) 
j 
We must have 
1,1 i-l ..1 
=y 012 T = 012 
-1 * 9-1 
Cuba rtl rg? 2 1.2 


(21.54) 


! must anticommute with c1 


Now we can determine the rJ one at a time. r 

and commute with c. Thus we can take r! = cl. Then from the next 
condition, r? must commute with c? and anticommute with c2. Thus we can 
take r? = o2. And so on — the ozs and os alternate. 


Thus we can take the matrix R to be 


R- [[ «i [[ «t Q1.55) 
Er Nit 
Note that 
R=1, o R=R™ and | RF = (-1)"™* D2 R (21.56) 


So for n = 1 and 2, R is antisymmetric; for n = 3 and 4, R is symmetric; 
for n = 5 and 6, R is antisymmetric; etc. Thus 


Algebra Spinors 
SO(8k + 3) pseudo-real 
SO(8k + 5) pseudo-real (21.57) 
SO(8k + 7) real 
SO (8k + 1) real 
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Problems 


21.A. Show that the set of 10 matrices, 38, in, ET and 3m gener- 
ate the spinor representation of SO(5). Find the matrix R = R~? such that 
T, = —R T% R for this representation. 


21.B. Identify any convenient SO(2n — 1) subalgebra of SO(2n + 1) 
and determine how the spinor representation of SO(2n + 1) transforms under 
the subalgebra. 


Chapter 22 


SO(2n + 2) Spinors 


We can use the results for SO(2n + 1) to work out the spinor representations 
for SO(2n +2). We will use the same notation as in the previous discussions. 


22.1 Fundamental weights of SO(2n + 2) 


The Dynkin diagram for SO(2n + 2) is 


OO- T Q2.1) 
" anti 


a a 
where 
a = e — ejt! forj=lton, at! = e” +e”t! (22.2) 
All the roots have the form 
tei +e, jk (22.3) 


We are interested in the two special representations corresponding to the 
last two fundamental weights, 


(22.4) 


Call the corresponding representations D” and D"*1. The weights of D" are 
of the form 


1 : 
Y mya (22.5) 


265 DOI: 10.1201/9780429499210-23 


266 CHAPTER 22. SO(2N 4- 2) SPINORS 


where 
n+l 


"j= +1 and I] 52-1 (22.6) 
j= 


This is because the roots always change the sign of two of the 7s. The weights 
of D"+! have the same form but with 


n+l 
Il y=: (22.7) 
ju 


Now that we know the weights, we can determine the reality properties 
of the representations. If n is odd, so that the algebra has the form SO(4m) 
for integer m, then there are an even number of 7s. Thus —j” is a weight 
in D^, because it still satisfies [7*] m; = —1. Evidently, it is the lowest 
weight in D". Similarly, —u”+! is the lowest weight in D”+!, Thus, for n 
odd, the spinor representations of SO(2n + 2) are real (or pseudo-real). 

But for even n, —” is the lowest weight in D"+!, because it satisfies 
I rj = 1. Likewise, —u”*? is the lowest weight of D”. Thus for even 


n, the spinor representations are complex. D” is D"*! and D" is pn, 
To construct these representations in detail, consider the SO(2n + 1) 
subgroup of SO(2n + 2) generated by 


Mj, for j,k <2n+1 (22.8) 
This eliminates the Cartan generator 
Hy41 = Mon+1,2n+2 (22.9) 
The 4n generators with weights 
ce ce for Gen (22.10) 
which are linear combinations of 
Mion4i and Mjonyo for j <2n (22.11) 
collapse to the 2n generators M; 2n+1 with weights 
te) for j<n (09.12) 


Under this SO(2n+1) subgroup both D" and D”*? transform like the spinor 
representation we just analyzed. Because Hn+1 is not a generator of this 
subgroup, we can take over the previous analysis without change if we just 
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ignore the n + 1 component of all the weight vectors. When we do this, the 
weight vectors are the same for both D” and D^*!: 


n 
Sine? for =t] (22.13) 


In the tensor product notation, 7; = ci and the generators are given by 


3 

1 E j 

gt abot 
eb 5 


M»j—12n41 (22.14) 


1 
Moaj2n+1 = 573 

Now going back to the full SO(2n + 2) algebra, in the representation 
D”, we know that 


m+ = — [[ m= 79i: 9$ (22.15) 
j=l 

Therefore i 

Hn+1 Mon41i2n42 = E gii. gh (22.16) 
Now all the generators can by found using the commutation relations. 
In D^*1 

Thai = I nj = 93° (22.17) 

and ; 
Fins) = Mon+1,2n+2 = ; 73 -03 (22.18) 


Now that we have explicit forms for the representations, we can examine 
the reality properties we discussed earlier in more detail. As in SO(2n + 1), 
we define the matrix 


RR Iie TI of (22.19) 


odd even 


and then look at the complex conjugate representations 
=RT R! (22.20) 
for D" and D"+1. We already know that 
-R M}, R = My, (22.21) 
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for the SO(2n + 1) subgroup, j,k < 2n + 1. It is then enough to see what 
happen to Mon+41,2n+2 under complex conjugation. If n is odd, 


—RM3,i12n42 Ro) = Man+1,2n42 (22.22) 


and then the equivalence can be established for all the generators using the 
commutation relations. Thus, as we already knew, the representation is real. 
But now we know R, so we can determine whether it is real-positive or 
pseudo-real. For n = 4k + 1, R is antisymmetric, so D” and D”+! are 
pseudo-real. For n = 4k + 3, R is symmetric, so D" and D”+! are real. 

For even n, 


-R Mžn+1,2n4+2 R! = —Mon+1,2n+2 (22.23) 


so the representations D” and D"*! get interchanged and thus they are com- 
plex. 

The full story of the reality properties of the SO(N) spinors is then as 
follows: 


Algebra Spinors 
SO(8k +3 pseudo-real 
SO(8k + 4 pseudo-real 
SO(8k +5 pseudo-real 
SO(8k +6) complex (22.24) 
SO(8k +7) real 
SO(8k) real 
SO(8k + 1) real 
SO(8k + 2) complex 


Problems 


22.A. Show that SO(2n) has a regular maximal subalgebra, SO(2m) x 
SO(2n—2m). How do the spinor representations of SO(2n) transform under 
the subgroup? 


22.B. Show that SO(2n--1) has a regular maximal subalgebra, SO(2m) x 
SO(2n — 2m +1). How do the spinor representations of SO(2n + 1) trans- 
form under the subgroup? 


22.C. Show that SO(4) has the same algebra as SU (2) x SU(2). Thus, 
it is not simple. Nevertheless, the arguments in the chapter apply. Explain 
how. 


22.1. FUNDAMENTAL WEIGHTS OF SO(2N 4- 2) 269 


22.D. Show that the SO(6) algebra and the SU (4) algebra are equiva- 
lent, with the 4 of SU (4) corresponding to a spinor representation of SO(6). 
In SU (4), 48 4 = 66 10. The 6 is the vector representation of SO(6). What 
is the 10, in the SO(6) language? 


Chapter 23 


SU (n) C SO(n) 


In order to discuss the embedding of SU (n) in SO(2n) efficiently, we will 
introduce another language for talking about the spinor representations — 
Clifford algebras. 


23.1 Clifford algebras 


A Clifford algebra is a set of operators satisfying the following anticommu- 
tation relations: 


D;,I,)b-2ó6;, forj,k=1toNn (23.1) 
j j 


If you are given such a set of operators, you can immediately construct a 
representation of SO(N) as follows: 


Mj = Z E5 Ta (232) 
4 
It is easy to check, using the Jacobi identity, that the M;, have the commuta- 
tion relations of SO(N), and furthermore that the T's are a set of tensor oper- 
ators transforming according to the N dimensional representation of SO(N), 
satisfying 


[Mik Tj] = i (50 Tk — dpe Y =Tn [MR lme (23.3) 


where , 
[Mi Jem =-1 (530 Skm — Ojm m) (23.4) 


generate the vector representation, D!, with highest weight Ll. 
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For SO(2n + 1) it is straightforward to find a Clifford algebra that gives 
the spinor representation we constructed. It is 


Prssolo? -03 T2 -—olo2-..02 
Ty = 02.03 Ty = —o?---o? 
(23.5) 
Pon-1 = 05 Ton = —OT 
Tanti = clo “OR 
Notice that the product 
Tiro: Tony = 2” (23.6) 


is proportional to the identity element. 

There is a connection of this with SO(2n+2), even though we don’t have 
enough elements in the Clifford algebra to generate it. These elements of the 
Clifford algebra are actually proportional to M; 2n+2 in the spinor represen- 
tation that we constructed. These generators transform like the components 
of a vector under the SO(2n + 1) subgroup of SO(2n + 2). 

While we cannot construct a representation of the SO(2n + 2) algebra 
from this Clifford algebra (at least, we cannot use (23.2) to construct the gen- 
erators because we do not have enough T's), we can construct the SO(2n) al- 
gebra just by leaving out ['2,41. But the resulting representation is reducible, 
because there is a nontrivial matrix that commutes with all of the generators 
— namely [2,41 itself — 


T 
T2541 = (7i) T3T2 Ts = [[ $3 (23.7) 
j=l 


We can use this to construct projection operators onto the two invariant sub- 
spaces that transform according to the irreducible representations, D”~! and 
D”. 


1 

= (1 -— Tons) projects onto Des 
A (23.8) 
5 (1 + Pont) projects onto D?” 


because [Jj 2 is —1 on the first subspace and +1 on the second. 


272 CHAPTER 23. SU(N) c SO(2N) 


23.2 Tn and R as invariant tensors 


The main reason that the Clifford algebra construction is useful is that I's are 
a set of invariant tensors (see (23.3). For that reason, we can use it to help 
with the process of Clebsch-Gordan decomposition of products of spinors. 
Let us return to SO(2n + 1) and write out the commutation relation (23.3) in 
terms of components — it looks like 


[M jk]z [Ue]zy = [M5 yz [Ud]: 


: (23.9) 

+M? Jem [Um]zy =0 
Thus [I°g],, is an invariant tensor in which the x index transforms like the 
spinor representation, D”, the y index like the conjugate representation, D 
and the Z like the vector representation, D!. Of course, we know that in 
SO(2n + 1) the representation D” is equivalent to D" by a similarity trans- 
formation involving the matrix R, but in this form for the I's, we have not 
done the similarity transformation, it is the D” matrices that appear explic- 
itly. We could put in the transformation explicitly. Suppose we do that using 


—Mj, = = —M}, = R! My, R (23.10) 


It is useful to write this in terms of upper and lower indices. If the matrix 
elements of the generators of D” (say) are written as 


[Mik]; (23.11) 


where z refers to the row (the first index of the matrix) and y to the column 
(the second index), then R behaves as an invariant tensor with two upper 
indices. We showed this in chapter 21, but let's do it again. If we multiply on 
the left by R and put the indices in, (23.10) becomes 


—R* [Mj]? = [Mjr]; R” (23.12) 
or in prettier form 
[Mjx]z R” + [Mj]? R = 0 (23.13) 
which is the statement that R7” is an invariant tensor 
The elements of the Clifford algebra have one upper and one lower index, 
like the M;;s 
DE (23.14) 
These do not have a definite symmetry property (or more precisely, I; has 
different symmetry for different values of 7 and these properties depend on 
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the basis) because the upper and lower indices are different kinds of objects 
so it doesn't make any sense to interchange them. But by multiplying by R, 
we raise the lower index 


(Cr; RJ" = [D;]£ R” (23.15) 


This object can and does have definite symmetry. 
Now let us rewrite the commutation relation of M;, with Ie with upper 

and lower indices 
[Mya]? [Fa]; + [R^ Myx RE Pelz 


(23.16) 
[M lem Emly = 0 


which we can rewrite by multiplying by R7" and summing over y 


[Myx]? (Pe RJ" + [RT R^! Mj RI? (Te) 
ME, lem (Pm RJ"" 
= [Myx]? [Pe RJ" + [RT R^! Mj] [Pe RT" 
MEE lem [Um RI?" 
= [Mj]z [Pe R]" + Mj]? [Pc R] 
*[MZ, lem [Pm RJ" = 0 


(23.17) 


In the last step we have used the fact that R is either symmetric or antisym- 
metric, so I can replace the two RTs by two Rs. Thus it is the product D; R 
that is an invariant tensor in the space D” & D" @ D!. These products must 
be either symmetric or antisymmetric in exchange of the identical D" labels, 
unlike the I's themselves. This will seem familiar to those of you who have 
studied the Dirac equation, where something similar happens with the Dirac 
"y matrices. 

To see whether I’; R is symmetric or antisymmetric, we can check any 
Dj. The easiest is 2,4, which is just a product of oss in all the spaces. 
When we multiply by R, the result is a kind of mirror image of R, where, up 
to factors of i, the os and os are simply interchanged. If we define 


RT —-ggR and — (TR) =nrrITjR (23.18) 


then 
"R:T"TR —(-1)* (23.19) 
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because in each two dimensional subspace, there isa c; in either R or Fon+1 R 
and a c in the other. Since we know that 


ng = (-1)^*02 (23.20) 


we find that 
npr = (71) 0-9 (23.21) 


23.3 Products of I's 


The reason that it is sometimes better to stick to the Fs rather than using the 
symmetric I’ Rs, is that matrix products of the Is also yield invariant tensors. 
These products have one upper and one lower index, so like I’; they live in 
the space D" & D”, and we can use them to complete the process of Clebsch- 
Gordan decomposition. For example, consider the commutator, [T ;, T4]. We 
already know that this is proportional to M;;, which is an antisymmetric 
tensor operator. But we could also conclude this using the Jacobi identity: 


[Mjk, Te, Pml] = [[Mjx, Ty] Dm] + Te, [M;jx, T] 


n MP MM (23.22) 
= Em, m] [Mik Jm + [ £5 m] [ jk Jmm 


which means that |I';, T+] transforms like the antisymmetric tensor product 
of two vectors. 

More generally the interesting combinations are the products that are an- 
tisymmetric in the vector indices, defined as a sum over permutations of the 
indices. For m < n, define 


1 
I' P; TD; ] = > ADE, Tk tt Diem (23.23) 


with the — sign for the odd permutations. The other combinations are not 
interesting because the anticommutator of two I's can be eliminated using 
the Clifford algebra. The Jacobi identity can be used in exactly the same 
way to show that this transforms like an antisymmetric tensor with m vector 
indices. Of course, we don’t really have to do this complicated sum because 
the Clifford algebra guarantees that the I; anticommute for different values 
of j, so really all the sum is doing is ensuring that no two indices are the 
same. 

We do not get any new matrices form > n, because of the fact that the 
product of all 2n+1 Ts is proportional to the identity, so the antisymmetric 
product of n+x Is is related to the antisymmetric product of n —&--1 T's. 
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The product Ti; I5, - - I5 tells us explicitly how the antisymmetric jz 


index tensor appears in the tensor product D" & D”. The tensor product of 
two spinors in SO(2n + 1) can be decomposed into antisymmetric tensors of 
rank O (the trivial representation), 1 (the vector), 2 (the adjoint for n > 1), 
and so on up to rank n. You can see that the dimensions work out because of 
the binomial theorem: 


2n41 
gmH = (141) = Y G T ) 
k 
k=0 (23.24) 


2 1 
sE ug 


k=0 


where the second line follows because 


2n +1 2n +1 
= 23.2. 
Ce iu (23.25) 


so that the sum from k = 0 to n is the same as the sum from n+1 to 2n+1. 

Somewhat more bizarre is the breakup into symmetric versus antisym- 
metric products. All the products have one upper and one lower index, so 
again, to get something that is either symmetric or antisymmetric, we must 
multiply by R. The result of multiplying by R. the product of k distinct T4 
matrices (which is the antisymmetric product, because the different indices 
anticommute) can be written as 


Py Ty: DER 


zn E (23.26) 
= (Pj, R)R (Pj, R)R e (Tj, R) 


The transpose is 


(Dj, RR (TS R RM (LR) 
=n Mgr RR e (DQBR)R (DR) (23.27) 
=m Mri, TAT AR 

where we have used the fact that R^! (which has two lower indices) has the 


same symmetry properties as R itself. To get the I‘; factors back into the 
original order we have to make 


(k — 1) (k-2)+---+1=k(k—1)/2 (23.28) 
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transpositions, each of which gives a — sign because of the Clifford algebra. 
Thus we have finally 


(E ja um d 
(23.29) 
= (-1) peo: RMR Al ie: DR 
We can write the factor in front somewhat more elegantly as follows 


(51 1906-024 En 


(-1)/*7 Don (nnrrg) 
(::1)56- 1/26. yer) jak (23.30) 
-(- Lye 9g 1)70+1)/2(—1) 7% 

(1) 


1 (n—k)(n—k--1)/2 
Thus in SO(2n + 1), the symmetry properties of the k index antisymmetric 


tensor in the tensor product of D” & D" repeat with period 4, and look like 
this: 


(23.31) 


Thus for SO(3) the 1 is antisymmetric and D! is symmetric. For SO(5) 
the 1 and D! are antisymmetric while D? is symmetric. For SO(7) the D! 
and D? are antisymmetric while 1 and D? are symmetric. For SO(9) the D? 
and D? are antisymmetric while 1, D! and D* are symmetric. And so on! 

For SO(2n), the analysis is more complicated because the 2" dimen- 
sional space on which the Clifford algebra acts is not irreducible. We will use 
the projection operators onto the irreducible subspaces 


P= ;( + Pont) (23.32) 


P, projects onto D" while P_ projects onto D?-1. Now a general 2” x 2” 
transforms under commutation with the generators like the tensor product 


(D" e De (D e D) (23.33) 
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Then we can project out all four possible transformation laws with the pro- 
jection operators. If K is an arbitrary 2" x 2” matrix, then: 


P* K P+ transforms like D” @ D” 
P- K P~ transforms like D"-!@D"' 
TQn-1 


P% K P~ transforms like D” & D 
P- K P* transforms like D”! & D” 


(23.34) 


As before, we can construct the K matrices out of antisymmetric products 
of the Ts. We need not include I'o4,,, because we are interested only in 
SO(2n) (and the projection operators make it +1 anyway). Furthermore, 
once we leave out ['2,,41, the projection operators pick out either the even or 
odd products, because the other I's anticommute with I5, 1, and therefore 


D;P,—P.TI; forj=1to2n (23.35) 


Thus only an odd number of I's can appear between P, and P_ (in either 
order) and only an even number of I's can appear between two P, or two 
P... As before, we can ignore tensors with rank m > n. 


23.4 Self-duality 


There is an additional subtlety that occurs for tensors of rank n. Not all the 


components of the rank n antisymmetric tensor in D” & D" or D”! Q D 
are independent. For example in D" & D” 


PQ,I1I2:o Ty Ph = P4Iq4T5: Cn Tongs Ph 
za PITE estas 


i (23.36) 
= (-4)* (—1)99-UP P, Payı Png- Dos Ph 


ane l 
= (—i)” n! €12.-n 51-55 P, Pi AEF Pijn P. 

Evidently, the rank n tensors satisfy a self-duality condition, the nature 
of which depends on n. If n is even, the relation is real, and the tensor is 


either self-dual (for D" & D”), satisfying 


1 


Ag sides = $i Chin kick PN (23.37) 
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or anti-self-dual (for D^-! & D" '), satisfying 


1 
Ar, = Tal Eji dn ki kn Akika (23.38) 


If n is odd, on the other hand, the relation is complex: 


Aissa = +1 €ji- ja ki kn Ak, «kn (23.39) 
and thus the representations are complex. It is through this complex self- 
duality condition that the complexity of the SO(4n + 2) spinor representa- 
tions is manifested in the ordinary tensor representations. 

We sumniarize all of this below, incorporating the fact that for SO(2n) 
with even n, D” = D" and D"! = D?-, while for odd n, D" = D"^-! 
and D"! = D^. We indicate the rank m antisymmetric tensor representa- 
tion by (m). 


n 
SO(2n +1): D" & D” = V (k) 
k=0 
m-—1 
SO(Am) : D?" & D?n-! = V" (2k + 1) 
k=0 
m—1 
D?" @ D™ = Ñ` (2k) + (2m). 
k=0 
m-1 
D?n-! g p?m-! = V^ (2k) + (2m). 
2 " (23.40) 
SO(4m +2): D?" @ p?m = S^ (2k) 
m-—1 rov 
p? @ DM = S7 (2k +1) + (2m - 1) 
"T k=0 
D?" @D'™ = S (2k +1) + (2m 4 1) 
k=0 


where (2m)* are self-dual and anti-self-dual respectively, and (2m+1), and 


(2m+1)2 satisfy complex self-duality conditions. 
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23.5 Example: SO(10) 


For example, consider the 16 dimensional D? representation of SO(10). We 
see that 
16 6 16 = (1) e (3) & (5i (23.41) 


The (1) is the 10 dimensional vector. The (3) is the 10 - 9-8 /6 — 120 dimen- 
sional three index antisymmetric tensor. The (5) is the 5 index antisymmetric 
tensor with a complex self-duality condition, with dimension 
110-9-8-7-6 
2 E.A. — 126 (23.42) 
The symmetry of these under exchange of the identical 16's can be found by 
looking at the discussion of the symmetry of the D; Rs for SU (2n + 1) — in 
this case we want n — 5 to focus on the 2? dimensional space of the Clifford 
algebra. The symmetry factor for (k) is (—1)(79(n—k-0/2. so the (1) and 
(5) are symmetric and the (3) is antisymmetric. 
Likewise 


16 @ 16 = (0) @ (2) @ (4) 210459210 (23.43) 


23.6 The SU(n) subalgebra 


Now we can identify an SU(n) subgroup of SO(2n). From the Clifford 
algebra, we can construct the objects 


1 
A; = ={To;-1 ~ iI; 
č 2 (rs i (23.44) 
Al = 5 (Tui +i T2) 
Because of the Clifford algebra, these satisfy 
UAE T {4}, Ab zi (23.45) 


{Aj AL} = dj 


This is, therefore, a set of creation and annihilation operators, from which we 
already know how to construct an SU (n) algebra. Using the matrix elements, 
(Taljx, of the defining representation of SU (n), we define 


Joss ANT Ax (23.46) 
j,k 
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We know that the T, generate SU (n) on the 2” dimensional space of the 
Clifford algebra, but we must also show that it is a subalgebra of SO(2n). To 
see this we can write 


Aj A i {A}, Ar} T E |A}, Ae] -5 p 


1 1 
*5M»3j-13k-1 + z M2j-1,2k = 5 M2j,2b-1 + 3 Majak 


The ó;j, term does not contribute to Tọ because [T,];,, is traceless. The other 
terms are SO(2n) generators. So, the T, are linear combinations of the 
S O(2n) generators, thus they generate a subalgebra. 

Denote by |0) the state that is annihilated by all the A;. In our representa- 
tion of the Clifford algebra, all the A;s are proportional to lowering operators, 

o) , so [0) is the state for which all the oj = —1. It is in the representation 

D^ for n even and D"-1 for n odd. 

As always, the creation operators, Al, are tensor operators: 


[Tas At] = 37 A} [Talki (23.48) 
k 


(23.47) 


They transform according to the n dimensional defining representation of 
SU(n). 

When we act on |0} with some number of Ats (say m), we therefore get 
a set of states transforming like the antisymmetric tensor product of m ns 
because the A's anticommute. This is the fundamental representation [m]. 
Otherwise, this construction is similar to what we did with the 3-dimensional 
harmonic oscillator. 

Note also that if m is even, the states formed by acting on |0} with m Ats 
are in the same irreducible representation of SO(2n) as the |0) state, because 
L'2441 anticommutes with each At, so it commutes with the product of an 
even number of them. But for odd m, the states are in the other irreducible 
SO(2n) representation. 

Putting this together, we can summarize the embedding of SU(n) in 
SO(2n) as follows. For SO(4n + 2), 

n n 
D= piri Dsg) (23.49) 
j=0 À j=0 


This is consistent with the fact that D2” = D?^*! because [j] = [2n+1— 7]. 
For SO (4n), 


n n—1 
p^-YX^pj Day Pii (23.50) 
j=0 i= 
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There is an SO(2n) generator which commutes with all of the generators 
of the SU (n) subgroup we have constructed. It is 


n n 
n 
S=} Majus; => AL A; — 5 (23.51) 
j=l j=l 
S generates a U(1) subalgebra. In the space of the Clifford Algebra 


jg. n: 
ego d (23.52) 


Thus S|0) = —$ |0). Then, since [s. Al] = Al, the creation operators raise 
S. Thus the representation [m] in the spinor representations of SO(2n) has 
S=m—n/2. 


Problems 
23.A. Check that the dimensions work out in (23.49) and (23.50) by 


using the binomial theorem. 


23.B. Use (23.9) and (23.44) to determine how the vector representation, 
D', of SO(2n) transforms under SU (n). Explain why this result is obvious. 


23.C. Let u* be a completely antisymmetric tensor in SO(6). A self- 
duality condition has the form 


ys ke - A c kfabc ute . 
What are the possible values of A? 


23.D. How do the spinor representations of SO(14) transform under the 
following subgroups: 


Chapter 24 


SO(10) 


SU (5) is the only choice for a unifying algebra that makes use only of the 
matter particles that we actually see. However, it is not obvious that this is a 
necessary property. We know that the unifying symmetry, whatever it is (and 
assuming that it exists at all!) must be spontaneously broken to account for 
the differences in the interactions that we see. It could be that the process of 
spontaneous breaking gives a large mass to some of the matter particles of a 
representation, thus effectively removing them. The next simplest unification, 
based on the algebra S'O(10), adds just one such matter particle - a right- 
handed neutrino. 


24.4 SO(10) and SU(4) x SU(2) x SU(2) 


Let us begin the discussion of SO(10) unification by considering (23.49) for 
n = 2. This shows that the SU(5) content of the spinor representations of 
S O(10) is as follows: 


D? = [1] + [3] + [5] 2 5 + 10+ 1, 


A (24.1) 
D* = [0] + [2] + [4] =1 +10 +5. 

Thus D? has the right SU (5) content to describe the right-handed creation 
operator of a family of quarks and leptons, with the addition of one SU (5) 
singlet. Likewise, D^ behaves like the left-handed creation operators plus a 
singlet. 

Thus SO(10) has the interesting feature that it incorporates all the cre- 
ation operators of a single family into a single irreducible representation. Of 
course, it also incorporates an extra singlet which does not correspond to any 
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of the particles that we have observed in the world. But we can hope that this 
extra particle will get a large mass in the spontaneous symmetry breaking 
process. 

The extra singlet is also related to the particles that we actually observe 
in an interesting way. When we include it, we can restore the parity symme- 
try that is lost in SU(5). To see what is happening, let us examine another 
subgroup of SO(10). 

The SU (5) subalgebra of SO(10) is regular, but not maximal. It is ob- 
tained by removing one of the circles from the Dynkin diagram. To find the 
regular maximal subalgebras, consider the extended Dynkin diagram: 


(24.2) 


If we remove the circle labeled with the x, the diagram falls apart into SU (2) x 
SU (2) x SU (4). The SU (2) x SU(2) is the same algebra as SO(4), and the 
SU (4) is the same as SO(6). Thus this is the subalgebra of the 10 dimen- 
sional rotation generators which are block diagonal on 4 and 6 dimensional 
subspaces. 

We can use the Dynkin diagrams rather directly to see how the spinor 
representations transform under the subgroup. The weights of D? are 


5 5 
Y ne, where [[w-1 (24.3) 
i=1 


i=l 


from (22.5)-(22.7). We know from the form of the extended Dynkin diagram 
that the SU (4) (or SO(6)) subgroup has roots a, a”, and a? (the lowest 
root). Explicitly 
ol =e! —e?, ao =e’? —e3, o? = -el.e?, (24.4) 
Thus the roots of the SU (4) subalgebra are entirely in the three dimensional 
subspace of the weight space formed by the first three components. The 
weights, (24.3), decompose into two copies of each of the two spinor rep- 
resentations of SO(6), one for 717273 = 1, the other for mnong = —1. 
The SU(2) x SU(2) (or SO(4)) subgroup corresponds to the roots 


o =e*- 3 a? — e* 4 e? (24.5) 
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in the two dimensional subspace formed by the last two components of the 
weight space. a* generates an SU (2) under which the weights 


+(e* — e?) (24.6) 
form a doublet. o? generates an SU (2) under which the weights 

+(e! + eS) (24.7) 
form a doublet. Thus, the weights 


3 


$^ (ne) /24(e+e°)/2 for Iw-1 (248) 


t=1 


are associated with a representation of the SO(6) x SO(4) subalgebra which 
transforms like a spinor under the SO(6) (or equivalently, the 4 of SU(4)), 
like a singlet under the SU(2) associated with o^ and like a doublet under 
the SU (2) associated with o? (we will call this SU (2)' to distinguish it from 
the first SU (2)). Thus, under the SU (4) x SU (2) x SU (2)' subgroup, (24.8) 
is a (4, 1, 2). 

The weights 


3 


Y (nd)/25(* -6)/2 for Iv = -1 (24.9) 


t=1 


transform like the complex conjugate spinor under the SO(6) (or equiva- 
lently, the F of SU(4)), like a doublet under the SU(2) associated with o* 
and like a singlet under the SU(2) associated with að. Thus, under the 
SU(4) x SU(2) x SU(2)' subgroup, (24.9) is a (4, 2, 1). 
Thus 
DË = (4,1,2) 6 (4, 2,1) (24.10) 


under SU (4) x SU(2) x SU(2)/. It then follows that the complex conjugate 
representation, D4, is 


D* + (4,1,2) @ (4,2, 1) (24.11) 


The SU (4) in SU(4) x SU(2) x SU(2) contains color SU(3). Each 4 
is a 3 @ 1 under SU(3). Each 4 isa 3 @ 1. Comparing (24.10) with (18.13), 
you can see in more detail how the SO(10) unification works. The weak 
interaction SU (2) in (18.13) must be identified with the SU (2) subgroup of 
SO(10). Under it, the creation operators for the right-handed antiquarks and 


24.2. * SPONTANEOUS BREAKING OF SO(10) 285 


the right-handed positron and antineutrino transform as doublets. Under the 
SU(2)' of SO(10), the creation operators for the right-handed quarks are a 
doublet, and there is another doublet comprising the creation operators for the 
right-handed electron and a neutral particle. This SU (2)' is a mirror image of 
the weak interaction SU(2). If we look at the creation operators for the left- 
handed fields, in the representation D^, the antiquarks will be a doublet under 
SU (2)', just as the right-handed antiquarks are a doublet under SU (2) in D?. 
Thus we should identify the neutral partner of the right-handed electron in 
the SU (2)' doublet as a right-handed neutrino, the looking-glass version of 
the left-handed neutrino. 

The SO(10) unification thus restores the mirror symmetry of the theory 
that was lost completely in SU(5). Of course, physics is not mirror sym- 
metric. Parity is violated. The right-handed neutrino has never been seen. If 
SO(10) unification is to describe the world, there must be some symmetry 
breaking that makes the right-handed neutrino and the particles associated 
with the SU (2)! generators (and indeed, all the other SO(10) generators ex- 
cept those of the SU(3) x SU(2) x U(1) subgroup) very heavy. 


24.0 * Spontaneous breaking of SO(10) 


The spontaneous breaking of SO(10) down to SU(3) x SU(2) x U(1) can 
be considered in several steps. First one can ask, how can we break SO(10) 
down to SU(5)? Then we can ask, what SO(10) representation contains 
the 24 of SU (5) that we use to break the SU (5) subgroup down further to 
SU (3) x SU(2) x U(1)? Finally, we can ask what SO(10) representation 
contains the 5 and 45 of SU (5) that we use to break the SU(3) x SU(2) x 
U (1) down to the U(1) of electric charge? 


24.3 * Breaking SO(10) > SU(5) 


The simplest representation that can break SO(10) to SU (5) is the 16 (or 16) 
This is evident from (24.1). Because the 16 has a component that transforms 
as a singlet under the SU(5) subgroup, if that component has a vacuum value, 
the SO(10) symmetry will be spontaneously broken down to SU (5). In the 
first discussion of SO(10), this representation was used. However, there is 
something slightly unattractive about it. It cannot be used (at least at the clas- 
sical level) to give a large mass to the right- handed neutrino. It is important 
to have some mechanism for getting rid of the right-handed neutrino. It has 
not been seen, and if it were present, it would cause problems for both particle 
physics and for cosmology. The original discussion of SO(10) assumed the 
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existence of an SO(10) singlet neutrino, in addition to the 16. Then the 16 
Higgs representation could be used to put the right-handed neutrino together 
with this extra singlet into a massive particle, thus eliminating the unseen 
right-handed neutrino from the low-energy particle spectrum. 

What is unattractive about this is that the addition of the extra singlet 
may be a step backwards. The SU (5) model already unified the right-handed 
particles in two irreducible representations. The SO(10) model without the 
extra singlet, with all the particles of the lightest family in a single irreducible 
representation, is perhaps more "unified," but with the extra singlet, it seems 
like an unnecessary complication of the original SU (5) model. Is it possible 
to eliminate the right-handed neutrino without adding a singlet? The answer 
is yes, with a slightly peculiar twist. One can treat the right-handed neutrino 
as its own antiparticle, and find a Higgs in the tensor product of two 16's that 
is responsible for its mass. The vacuum value of the Higgs representation 
must be an SU (5) singlet, as before, so that it can give mass to the SU(5) 
singlet right-handed neutrino. 

The tensor product of two 16s is 


10 @ 120 e 126 (24.12) 


where the 10 is the S'O(10) vector, the 120 is an antisymmetric three index 
tensor, and the 126 is a complex self-dual five index tensor. The 10 and 126 
appear symmetrically in the tensor product of the two 16s (because the two 
16s are the same representation, their tensor product can be classified accord- 
ing to representations of 55), while the 120 appear antisymmetrically. Then 
we can easily figure out the SU(5) content of each of these representations 
by doing the tensor product of D? in (24.1) with itself. The antisymmetric 
part is 


(5+10+1)@(5+10+1)as 
(5@1)@(10@1) (24.13) 


= (5 9 5)as @ (10 8 10) 4s 6 (5 @ 10) e 
=100450504€5610 


Note that this is a real representation. The symmetric part is 
(5+10+1)@(5+10+1)s 


= (5 9 5)s @ (10 @ 10)5 @ (1 8 1)s 6 (5 8 10) 6 (5@1) e (10 8 1) 
=1 


5650506105045500 
(24.14) 
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Since the 10 is 5 + 5, the 126 is 
156508184505010 (24.15) 


This representation is complex, and it contains the SU (5) singlet component 
that we want to give the right-handed neutrino a mass. This is particularly 
useful, because an SU (5) singlet vacuum value cannot give mass to any of 
the other particles, because their masses require a vacuum value that breaks 
SU(2) x U(1). Thus if the vacuum value of the SU(5) singlet component 
of the 126 is very large, it will give a large mass to the right-handed neu- 
trino, but all the other matter particles will remain massless until we turn on 
vacuum value for the Higgs that breaks SU(2) x U(1). This vacuum value 
can be much smaller (and indeed, as in SU(5) is has to be if the couplings 
are to come out correctly), which explains why we don't see the right-handed 
neutrino — it is much heavier than all the other fermions. 


Either with the 126 or the 16 and an extra singlet, we get rid of the right- 
handed neutrino and break the S'O(10) symmetry down to SU(5), we can 
then discuss the further breaking of SU (5) by asking what SO(10) represen- 
tations contain the S'U (5) representations that we discussed for the breaking 
of SU (5). 


24.4 * Breaking SO(10) > SU(3) x SU(2) x U(1) 


As in SU (5), the obvious representation to consider is the adjoint represen- 
tation, the 45 dimensional representation, D?, which is a real two-index an- 
tisymmetric tensor. Let us ask how this SO(10) representation transforms 
under SU (5). We can find this by noting that the antisymmetric tensor prod- 
uct of two 10s is a 45. Since the 10 is 5 @ 5, the 45 is 


45 + 24 --10 - 100-1 (24.16) 


This is encouraging. Because (24.16) contains an SU (5) 24, we might expect 
to find a possible vacuum value that breaks SO(10) to SU(3) x SU(2) x 
U (1). 

It is not quite that simple, however. The most general possible vacuum 
value for the antisymmetric tensor can be brought, by an SO(10) transforma- 
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tion into the following canonical form: 


Ü d. 0e 99 09- OO o a 
can Uo 26 O- 8 0. 39 Os de 9 

O° 407 ^U ecu 0 c Oe SOP Be 0 

0 0 -a 0 0 0 0 0 0 0 

Qc dc 4 e c0. Ge dE ur On ug 

ENERO S - M E M CH 

6 0 0 0 0 0 0 a 0 0 

6°20) QU. 0) Oe Hag 0 <0 

0o 0 0 0 0 0 0 0 0 æ 

o o o E: E DO. OP 20 sas 0 


The only way this can commute with SU(3) x SU(2) x U(1) is if two of 
the as, say a, and a» are equal, and the other three as are equal, so that the 
vacuum value has the form 


a, h i02 0 
( 0 a3 I3102 ) (24.18) 


where the first block is 4 x 4, written as a tensor product of two two- dimen- 
sional spaces, and the second is 6 x 6, written similarly as tensor product of 
two- and three-dimensional spaces. This commutes with matrices of the form 


Áz + 02 S2 0 
( 0 PU " (24.19) 


where the As are antisymmetric and the Ss are symmetric matrices in the 
appropriate spaces. The A» + o2 S5 generate an SU(2) x U(1), while the 
Aa + 23 S3 generate an SU (3) x U(1). There is an extra U (1) here that we 
do not want, but it is broken by the vacuum value of the 16 or 126. Thus 
together with the 16 or 126, the 45 can do the required symmetry breaking 
down to SU (3) x SU(2) x U(1). 

There are other options for the SO(10) breaking. We will just mention 
one. The symmetry can be broken by the vacuum value of a 54, a real trace- 
less, symmetric tensor. This is also simple to analyze, because like the 45, it 
can be brought into a simple canonical form by an SO(10) transformation. 
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In fact, this is even simpler — the 54 can be diagonalized 


a 0 0 0 0 00 00 0 
0 a 0 00 00 00 0 
0 0a 00 00 00 0 
0o 0 0 ay 0, 0-0, 0. 0 0 
0 0 0 0 a4 0 0 0 0 0 
0 0 0 0 0 a4 0 0 0 0 eee) 
o 0 0 0 90> 0: ag. 87 “Or 0 
0 0 0 0 0 0 0 a 0 0 
0 0 0 0 0 0 0 0 a 0 
0 00 0 0 0 0 0 O0 ar 
where 
10 
» dj 50 (24.21) 
j=l 


The only way this can commute with SU(3) x SU(2) x U(1) is if four 
ji the as are equal, so that there is an unbroken SO(4) that contains the 

SU(2) x U(1), and the other six as are equal, so that there is an unbroken 

SO(6) that contains SU(3). Thus a vacuum value for the 54 that preserves 

SU(3) x SU(2) x U(1) automatically preserves a larger symmetry, SO(6) x 
SO(4), which is in fact just the SU(2) x SU(2) x SU(4) subgroup that we 
discussed at the beginning of this section. But in fact, this is really all we need 
from the 54. The 16 or 126 vacuum value breaks the SU (2) x SU (2) x SU (4) 
the rest of the way down to SU(3) x SU(2) x U(1). 


24.5 * Breaking SO(10) + SU(3) x U(1) 


Finally, we can think about what S'O(10) representations can be responsible 
for breaking SU (2) x U(1) and giving mass to the quarks and leptons. We 
have seen that these must contain a 5 or 45 of SU(5) and they should be 
contained in the tensor product of two 16s, (24.12). You can see that all three 
of the representations in 16 x 16 can do the job. Each gives a somewhat 
different pattern of mass relations among the particles of different charges 
and colors. 


24.6 * Lepton number as a fourth color 


There is one more thing that is worth mentioning about the SU (2) x SU (2)' x 
SU(4) subgroup of SO(10). This algebra is interesting because it is the 
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smallest that contains the SU(3) x SU(2) x U(1) symmetry in a semi- 
simple aigebra. This is physically interesting, because it leads automatically 
to the quantization of electric charge. A simple algebra is not necessary for 
charge quantization, because it is only U (1) factors that are not constrained 
by commutation relations. As we mentioned above, the SU(2)' factor is a 
right-handed version of the electroweak SU(2), so that in this model, the 
left-handed nature of the electroweak interactions is picked out not by the al- 
gebra, but by the vacuum — by whatever spontaneously breaks SU (2)' much 
more strongly than SU (2). But the SU (4) is also quite interesting. One can 
think of it as treating lepton number as a fourth color. The neutrino is part 
of an SU (4) 4 with the three colors of u quarks, and the electron is part of 
an SU(4) 4 with the three colors of d quarks. It is only the spontaneous 
symmetry breaking, which leaves the color SU (3) subgroup unbroken, that 
distinguishes leptons from quarks.! 


Problems 

24.A. Show that the matrices 
1, 1, 1, 1l, l, l1, loz l., 1 
p D D pu^) aT !| 9 )| 9 |a 2D B 
99 9" 3» 2 Pl; 2 P2 91103 2 P3 2 7)01 3g” P2 


where G, 7, 7 and p are independent sets of Pauli matrices, generate a spinor 
representation of SO(10). Find an SU(2) x SU(2) x SU(4) subgroup in 
which one of the SU (2) factors is generated by the subset 7 (1 + p3)/4. 


24.B. What is the dimension of the S'O(10) representation with highest 
weight 2,5. How do you know? Hint: consider D? & D*. 


"This idea, along with the SU(4) x SU(2) x SU(2) model, was an important precursor 
to unification, discussed in J. Pati and A. Salam, Phys. Rev. D 10 (1974) 275. 


Chapter 25 


Automorphisms 


An automorphism A of a group G is a mapping of the group onto itself 
which preserves the group multiplication rule: 


A(9192) = A(g1) A(92) (25.1) 


For a Lie group, an automorphism of the group induces a mapping of the 
Lie algebra onto itself which preserves the commutation relations. Under an 
automorphism, the generators are mapped into linear combinations of gener- 
ators 


Ta — Aap Tb (25.2) 
such that 
[Ta, To] = i fasc Te (25.3) 
implies 
[Aaa To , Ap Ty] = $ fabe Acc! Te (25.4) 


25.1 Outer automorphisms 


Some automorphisms are trivial in the sense that the mapping they induce on 
the generators is an equivalence: 


Ag, Ty = RT, Ro (25.5) 
where R = e'9«7^ is a group element. This is called an inner automorphism. 
But some of the Lie algebras have non-trivial or outer automorphisms. 

For example, consider complex conjugation. If T, are the generators of 
some representation, the mapping 


Ta —T. (25.6) 
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is an automorphism in which generators corresponding to imaginary antisym- 
metric matrices are unchanged, while generators corresponding to real sym- 
metric matrices change sign (note that we can always choose the generators 
to be either antisymmetric or symmetric in the highest weight construction). 
Thus, an algebra can have complex representations only if it has some non- 
trivial automorphism. 

We can identify the nontrivial automorphisms by looking at the sym- 
metries of the Dynkin diagram. For example, consider SU (4) with Dynkin 


diagram 
O-O-O a5 


Since a! and o? appear symmetrically, the diagram with o! and o? inter- 
changed has all the same lengths and angles and thus generates the same 
algebra. So there is an automorphism of SU (4) in which the corresponding 
generators are interchanged, 


Ey +, Ex (25.8) 


along with all the other changes this induces in the explicit construction of 
the algebra from the diagram. This automorphism is nontrivial, since it inter- 
changes the fundamental representations D! and D?, which are inequivalent. 
In fact, since these representations are complex conjugates of one another, 
this is just the automorphism induced by complex conjugation, up to some 
trivial equivalence. All of the complex conjugation automorphisms are ob- 
tained in this way, associated with reflection symmetries of the Dynkin dia- 
gram. 

But not all reflection symmetries correspond to complex conjugation. For 
example, the SO(4n) groups have only real representations. Thus the reflec- 
tion symmetries of their Dynkin diagrams correspond to nontrivial automor- 
phisms that are not complex conjugations. The most interesting (and bizarre) 
example of this is the group SO(8), with Dynkin diagram 


1 (25.9) 


Here there is a separate automorphism for each permutation for at, o? and 


at. 
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25.2 Fun with SO(8) 


The roots of the SO(8) Dynkin diagram are 
o) =e — eit! forj=1to3, o*-e +e (25.10) 


The fundamental representations D!, D?, and D4, with highest weights jl, 
u3, and y+, are each 8 dimensional. Dl is the defining representation, an 
8-vector. D? and D^ are the two real spinor representations. The automor- 
phisms map these three representations into one another in all possible ways. 
First consider the mapping between D? and D^. This corresponds to the 
symmetry of the Dynkin diagram that interchanges o? and a4 with a! and 
a? held fixed. This is implemented by changing the sign of e^. That means 


H, — —Ha4 or M18 — — Mrs (25.11) 


and 
Ee +e4 o Epei -e4 or Mj — — Mj (25.12) 


This, of course, is what we found when we explicitly constructed these rep- 
resentations. M;y for j,k = 1 to 7 were the same for both representations, 
while 
Ld cloio$ for D? 

Hi =M -4 1? (25.13) 
2 030303 for D* 
The rest follow from commutation. Of course in this case, we got the re- 
sult just from the symmetry of the Dynkin diagram, without constructing the 
representation explicitly. 

Now consider the automorphism that interchanges a! and o? with o? 
and o* held fixed. This is rather weird, because it interchanges the vector 
representation D!, generated by the antisymmetric matrices with only two 
non-zero elements with the spinor representation D?, generated by products 
of Pauli matrices, whose square is proportional to the identity. The relevant 
mapping in terms of the weights is 


1 


1 


€ Eee Se 


e-e- (25.14) 


epee pet 
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Solving this for the e/s we find 
(25.15) 


so the Cartan generators get mapped in the same way 
Hı > = (Hi + Ho + Hs — Hs) 


( (25.16) 
H3 — z (Hh — H: + H3 + H4) 
( 


H3 —H, + Hz Hs + Ha) 


or 
1 
Mi; > P] Mio + M34 + Msg — Mrs) 


2 


1 (25.17) 
Mss > 3 Mi? — M34 + Msc + Mrs) 


( 
M34 > + (Mia + M34 — Msg + Mrs) 
( 


1 
Mrz > 3 (—Mi2 + M34 + Msg + Mrs) 


Note how cleverly the theory has solved the mapping problem. The anti- 
symmetric generators, M2;_1,2; of the vector representation get mapped into 
matrices of the form 


to 0 0 0 
1 0 to 0 0 
= 0 0 xay 0 (25.18) 
0 0 0 o 


whose square is 1/4. 

The way the rest of the mappings go is this. The other generators break 
up into 6 sets of 4, each of which mix up among themselves like the four 
Mi», M34, Msg and Mg that we just dealt with. 
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SO(8) has a very interesting subgroup — SU(2) x SU(2) x SU(2) x 
SU (2). It is easy to see that this is a maximal subalgebra. It is just SO(4) x 
SO(4). But the way the automorphism works on the different SU (2) factors 
is instructive. Let us look at the extended Dynkin diagram. It is 


(25.19) 


a? at 


where the lowest root, o? is —e! —e. Evidently, we get the SU (2) x SU (2) x 
SU (2) x SU (2) subalgebra by removing o7. The four SU (2)s are associated 
with the four mutually orthogonal roots, 
a = -el-e al =e! —e? 
(25.20) 
a? =e — 4 at - e? et 


Now consider the action of this subalgebra on the spinor representation D?, 
with weights 
me/2 fo — [[m--1 (25.21) 
j 


These break up into two sets transforming irreducibly under the subalgebra. 
The set 


;C +e? +e- e) 3 z(e +e- e+ e*) ; usns 
;be cere ee) d Se ee) 


is orthogonal to a! and a+, and thus transform trivially, like singlets under 
the two SU (2) associated with a! and o. But each weight is a component 
of doublet under the SU (2) associated with o? and o?. Similarly, the set 

1 1 

;(e- dàn), She- e-e- e), 

: "P 2 (25.23) 
L[—el 4 Tf. 2. — 53 4 
3 (Ce +e eee), ; Ce +e e e*) 


is orthogonal to o£ and a, and thus transform trivially, under these two 
SU (2), and are components of doublet under the SU (2) associated with a! 
and o^. Thus we say that under the 


SU(2)? x SU(2)! x SU(2? x SU (2) (25.24) 
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subgroup, the spinor representation D? transforms as 
(2,1,2,1) & (1,2, 1,2) (25.25) 


There are two other similar possibilities for the tranformation property of 
an 8: 


(2,2, l, 1) e (1, 1,2, 2) 
and (25.26) 
(2,1,1,2) 6 (1,2,2,1) 


These correspond to D! and D^ respectively. How do we know which is 
which? 


Problems 


25.A. Carry through the argument discussed at the end of the chapter 
and determine which representation in the last equation in the chapter is D! 
and which is D^. 


25.B. Does SO(8) have an SO(5) subgroup under which one spinor 
(D£, say) transforms like two SO(5) spinors while the other spinor (D?) 
transforms like an SO(5) vector and three singlets? Explain. 


Chapter 26 


Sp(2n) 


One reason that I want to discuss Sp(2n) in more detail is that it is a good 
excuse to introduce another notation for SU (n) which is often easier to work 
with. 


26.1 Weights of SU (n) 


The weights of the defining representation of SU (n), v! for i = 1 to n, have 
the following properties 


l 1 1 Aj 
j=l 


The condition on the sum follows from the tracelessness of the generators. 
The n v? span an n—1 dimensional space, but we can make things much 
more obvious by embedding the 77's in an n dimensional space, as follows: 


l;. 
d fu 
vi = (e X /n) (26.2) 
where the e are an orthonormal basis and the vector X is 
TL 
EDS. (26.3) 
k=1 
In this notation, the simple roots of SU (n) take the form 


C z a) , forj=lton—-1 (26.4) 


oj = pÍ — pitt = 
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These are also the first n—1 simple roots of Sp(2n). The last simple root is 


a” = 2v” + "B gu (26.5) 


where v™+! is a unit vector orthogonal to ef for j = 1 to n. Now the point 
is that we can take 


1 
ntl. 
V = Jn L (26.6) 
Then 
a” = y2e^ (26.7) 


We could have guessed this from the Dynkin diagram by comparing with 
our form for the roots of SO(2n + 1). The only difference here is overall 
normalization of the roots and that the last root is longer than the others, 
rather than shorter. But you should now see the connection with the previous, 
somewhat cumbersome notation. 

So the roots are 


+e +e ; 
== for j#k and +V2e! 26.8 
A iF (26.8) 
or equivalently 
+e) + ef 
Fe S® forall j,k (26.9) 
V2 


The weights of the 2n dimensional representation are 
te) /V2 (26.10) 


so D! has highest weight 
pl =e! /V2 (26.11) 


It is easy to see that the other fundamental representations have highest 
weight 
j 
ul 2 V eva (26.12) 


k=1 


which is related to the antisymmetric tensor product of j D's. 
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26.2 Tensors for Sp(2n) 


If we invent a tensor language in which the states of D! have a lower index 
(as in SU (n)), the tensor coefficients have upper indices and transform in the 
usual way 

(T, u)* = (7,]$ v? (26.13) 
where o and Ó run from 1 to 2n. It is most convenient to write these in 
the tensor product notation we introduced in which a = (j, £) where j runs 


from 1 to n and z from 1 to 2. Then T, are the generators of the defining 
representation, with the form 


ca [Talks = (Si oF + AL og) (26.14) 


An arbitrary linear combination of Sp(2n) generators has this form where 
the S are three independent real symmetric n x n matrices and A is an an- 
tisymmetric n x n matrix. For example we could denote the generators as 
follows: 


1 
Tuji? = = ójkóje[ou]sy for u = 1 to 3 


& = S 
Tusly = ; (Gió + 5jn5%e) [uley 
foru —1to3,i Æj 
Tolis = ; (4:1 5;e — ju) Oxy 


The complex conjugate representation, with a lower index transforms as 


(26.15) 


(Ta u)a = [Taf ug = - (778 up (26.16) 
But this representation is pseudoreal, because 
-[RI** (TE [R^ le = Ur] (26.17) 


where 
RITKY = $5. [olay (26.18) 


R is antisymmetric, so the representation is pseudoreal. As we saw with 
the orthogonal groups, R is an invariant tensor with two upper indices and 
R^! is an invariant tensor with two lower indices (they happen to have equal 
matrix elements in this case). Thus we need never consider tensors with lower 
indices, because we can always raise the lower indices and obtain tensors with 
the same information. 
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Now we can use this tensor notation to analyze the irreducible represen- 
tations as tensor products of the defining representation. The point is that 
tensor analysis looks very much like that in SU (2n), except that the exis- 
tence of the invariant tensors R and R~! allows us to get rid of lower indices 
and to reduce antisymmetric combinations. For example, consider the tensor 
product D! & D!, which explicitly looks like 


u? v? (26.19) 

The symmetric combination. 
go? = utuh + uf vo (26.20) 
is irreducible because we cannot use R to decompose it further. This has 


highest weight 2v!, so this is that adjoint representation. Note that the number 
of components is right: 


2n(2n + 1) 23x met) n(n — 1) (26.21) 
2 2 2 
But the antisymmetric combination 
ur vP — uf v? (26.22) 
can be reduced, because the combination 
O = [R^ }] ga u* v? (26.23) 
is a non-zero singlet. The combination 
AY = utuh — uf or — lins [Rory u P (26.24) 
n 
satisfies 
[R] ga A9? — 0 (26.25) 
because 
[R Ja RO? = T(R R) = 2n (26.26) 


Therefore A% is the irreducible representation D? and we can write the prod- 
uct in terms of irreducible combination as 


ut oP = 5 5° 4 = AAP + LRO (26.27) 
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It is sometimes useful to see all this explicitly in the tensor product no- 
tation. Here the discussion should remind vou of our treatment of symmetry 
and antisymmetry in the spinor representations of the orthogonal groups. The 
point is that the matrix generators of D! provide an invariant tensor that de- 
scribes how the adjoint representation appears in D! & D!. However, this 
tensor has one upper and one lower index. To find the invariant tensor in 
D! & D!, we must raise the lower index with R. That is, if we multiply the 
generators on the right by R which in the tensor product notation is just a2, 
we should get symmetric matrices with two upper indices. And this is right, 
because 


S.à-- Ajo (26.28) 
( ) 


is symmetric because in each term the factors in the two spaces have the 
same symmetry. In the same notation, one sees that c» itself is the singlet in 
D! & D}, and that D? has the form 


(& T 8") o (26.29) 


where S” is constrained to be traceless. 
Problems 


26.A. Find a set of Cartan generators for c - S 4- A that makes it obvious 
that this defining representation is the fundamental representation ji! . 


26.B. Find D! & D? in Sp(6) using tensor methods. Find the dimensions 
of each of the irreducible representations that appear in the tensor product. 


Chapter 27 


Odds and Ends 


27.1 Exceptional algebras and octonians 


The search for interesting unified theories, like the SU(5) and SO(10) the- 
ories, leads naturally to an interesting theory based on the algebra, Eg, one 
of the exceptional Lie algebras. The exceptional algebras are associated with 
the octonians, a peculiar set of objects of the form 


a+ baia (27.1) 


where a and the b, for a = 1 to 7 are real numbers. The 7, are a set of seven 
“imaginary units" which are generalizations of ? and of the quaternions, 7. 
The tg have the following multiplication law: 


taig = ~deg + gag, ia (27.2) 


where gagy is completely antisymmetric. In some basis, g is 


9123 = 9247 = 9451 = 9562 = 9634 = 9375 = 9716 = 1 (27.3) 


with all other components either zero or obtainable from (27.3) by antisym- 
metrization. This multiplication law can be obtained from the following pic- 
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ture: 
1 ~ 
7 S RC 
/ MN 
G z 222 
/ -7 
/ PE 
4 Pd 
4 qu 
PM S (27.4) 
^47 
3 4 
6 5 


The seven sets {jk£} for which gjx¢ is not zero are obtained by rotating the 
doted triangle, with the cyclic order of the indices maintained. 

The algebra (27.2) shares with the real numbers, the complex numbers 
and the quaternions a nice property that was discussed in chapter 19. It is a di- 
vision algebra with an absolute value that obeys a product rule. The absolute 
value, 


la + boie| = (a? n t) 3s (27.5) 
is preserved under multiplication. If A and B are octonians, then 
|AB| = |A||BI. (27.6) 
However, the octonian multiplication law is not associative! For example, 
(i149)t7 =igt7 =t5, 41 (tt7) = 2 (—t4) = —is. (27.7) 


Because of the lack of associativity, octonionic matrices do not generally 
form groups (whose multiplication laws must be associative). However, the 
octonians are connected with the exceptional algebras in less direct ways. For 
example, G is the subgroup of SO(7) that leaves the object gagy appearing 
in (27.2) invariant. 
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27.2 Eg unification 


Now back to Eg. Note first that to go from Eg — E7 — Es, you remove one 
circle from the left branch of the Dynkin diagram, as shown below. 


Es = SO(10) or 


Continuing the same series, you see that E; = SO(10) and Ey = SU(5). 
Because E4 and E; both give sensible unified theories, Eg is worth looking 
at. Of course, this argument is only suggestive, because it turns out that Ez 
and Es are not useful for unification. But Eg does turn out to be interesting. 

Let us begin by considering the root system and some of the fundamental 
representations. Label the roots as shown below: 


@ 
O@-O-O-Q 


The Cartan matrix is then 


(27.10) 
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From this we can construct the weights of the representation D! using the 
Dynkin coefficients. The result (without all the lines — there were too many 
to draw) looks like this: 


ELT 


(27.11) 


There are 27 weights here. This looks a little complicated, but one thing 
is clear. This is a complex representation— it is not symmetrical about the 
origin in weight space. To understand the structure of this object in terms of 
things we are more familiar with, let us see how this representation transforms 
under the two maximal regular subalgebras of Eg; SU(3) x SU(3) x SU(3) 
and SU(6) x SU(2). We can do this by considering the extended Dynkin 
diagram and finding the Dynkin coefficients of the 27 under the subalgebras. 
The extended Dynkin diagram is given by 


(at) 
(at) 
o Cm 
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It follows from (20.15)-(a) that 
oO +a! + 2a? + 3o? + 2at +a + 2a® — 0 (27.13) 


Thus we can compute the Dynkin coefficient of the weight u for the root o£, 
n. i 
p= MER (27.14) 
as linear combinations of the 6 coefficients: 


f = -L — 2 — 3B — 2t — qm. 26° (27.15) 


The result is shown below where the last column is 2°. 


(27.16) 


If we now remove one of the roots to get a regular maximal subalgebra, 
we can simply remove the Dynkin coefficients corresponding to the removed 
weight from (27.16), and what remains will be the Dynkin coefficients of 
the subalgebra. Below, we show the result of removing the root o? to get 
SU(3) x SU(3) x SU(3). We also write the remaining coefficients in the 


27.2. Eg UNIFICATION 307 


order 120654. We do this so that in each of the three SU(3) factors, the 


two roots appear together and the root on the outside of the extended Dynkin 
diagram appears first. 


(27.17) 


These can be organized into three sets of nine weights: 


[ {10,-11,0-1}{01,1-1,-10}00 |, [{01,1-1,-10}00{10,-11,0-1} |, ae 
and | 00{10-11,0-1}{01,1-1,-10} l 


In the set {10, —11,0 — 1}, we recognize the Dynkin coefficients of the 3 of 
SU (3). The set {01,1-1,-10} is the 3. Thus the 27 transforms as 


27 ~ (3,3,1) @ (1,3,3) 6 (3, 1,3) (27.19) 


An analogous argument, removing the root of, shows that the 27 trans- 
forms under the SU(6) x SU(2) subgroup as 


27 ~ (6,2) & (15,1) (27.20) 
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where the 15 is the fundamental representation D4, antisymmetric in 4 upper 
(or two lower) indices. 

Note that SO(10) and SU (5) are regular but not maximal subalgebras of 
Eg. You can probably guess how the 27 transforms under SO(10) and SU (5) 
from what we have already seen. But it is easy to check by removing the first 
Dynkin coefficient from (27.11) that the transformation law under SO(10) is 


27 1660610061 (27.21) 
where the 16 is D? and the transformation law under SU (5) is 
27 ~59 1091959591 (27.22) 


From equations (27.21) and (27.22), you can see that Eg unification can 
be related to SO(10) much as SO(10) is related to SU (5). In addition to the 
complex SO(10) representation, 16, the 27 of Eg contains real representa- 
tions which can be given a large mass by the symmetry breaking. 


27.3 Breaking Eg 


One of the features of Eg unification is that the 27, in which the matter parti- 
cles live, also contains SU (2) doublets, living in the 10 of SO(10), that could 
be responsible for the breaking of SU(2) x U(1) at low energy. One might 
ask the following group theory question. Can one break the Eg symmetry 
down to the low energy SU(3) x SU(2) x U(1) entirely with Higgs trans- 
forming like 27s. The answer is no. You can see this from (27.22). If there 
are 27s with vacuum values in the two SU (5) singlet directions, that breaks 
the symmetry down to SU (5). But there are no other SU (3) x SU (2) x U(1) 
singlet components in the 27, because there are none in the SU (5) 10, 5 or 5. 
Thus anything in the 27 that preserves SU(3) x SU (2) x U(1) also preserves 
SU (5). Something else is needed. Here we will simply state without proof 
that the additional breaking can be provided by a Higgs transforming like the 
78 dimensional adjoint representation. 


27.4 SU(3) x SU(3) x SU(3) unification 


In passing, it is worth noting that the SU(3) x SU(2) x U(1) symmetry is 
actually completely contained in the SU(3) x SU(3) x SU(3) subgroup of 
Eg. Observe further that the transformation properties of the 27 of Ec, from 
(27.19), are invariant under cyclic permutations of the three SU (3) factors. 
This cyclic symmetry is an inner automorphism of the Eg algebra, and thus a 
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discrete subgroup of Es. It is related to the symmetry of the extended Dynkin 
diagram of Es, (27.12). But it is an inner, rather than an outer automorphism, 
because it is not a symmetry of the Dynkin diagram. 

While SU (3) x SU(3) x SU(3) is not simple, if it is supplemented by 
the cyclic discrete symmetry, it shares many of the properties of a unified 
theory. The 27 is irreducible under the combination of the continuous and 
discrete symmetry. In this model, one can do the symmetry breaking entirely 
with Higgs transforming like the 27. 


27.5 Anomalies 


There is a peculiar constraint on unified theories that follows from the struc- 
ture of quantum field theory, the mathematical language in which all these 
theories are formulated. The constraint is that if the creation operators for 
all the right-handed spin 1/2 particles transform according to a representation 
generated by matrices TR, then qu must satisfy 


Tr( (72, m5) T5) =0. (27.23) 


You can show that this symmetric trace of three generators vanishes for 
all simple Lie algebras except SU (N) for N 7 3 (and SO(6) which is equiv- 
alent to SU (4)). In SU(N), suppose that T? generate the representation D 
of SU(N). Then define the invariant tensor d^" as follows: 


UIP TP} TP’) = a (27.24) 


for the defining representation D!. Then, for any representation, you can 
show that 
TP ty EP) SAD, (27.25) 


where A(D) is an integer, which is called the anomaly of the representation 
D. Thus (27.23) is the statement that the creation operators for the right- 
handed particles transform according to an anomaly free representation of 
the unifying group. 

You can easily derive the following properties of A(D) (see problem 
27.C): 


A(D) = —A(D) (27.26) 
A(D; ® D2) = A(D1) + A(D3) (27.27) 
A(D: ® D2) = dim(D,) A(D2) + dim( D2) A(Di) (27.28) 
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For SU (5), or SU(2) x U(1), the anomaly of the 10 is the same as the 
anomaly of the 5 (see problem 27.B), thus 56:10 is anomaly free. This in turn 
implies that the representation (18.12) of a family of right-handed particles 
under SU (2) xU (1) is anomaly free, because the generators of SU (2) x U(1) 
in this representation are just a subset of the generators of SU (5) in the 510. 


Problems 


27.4. . Find A(D) for the 6, and 10 of SU (3). 


27.B. You can find the anomaly of the fundamental representations of 
SU (n) by calculating the anomaly of the SU (3) subalgebra of SU (n) under 
which the n transforms like a single 3 and  — 3 singlets. Use this to show 
that the anomaly of the 10 is the same as the anomaly of the 5 in SU(5). 


27.C. Prove (27.27) and (27.28). 


Epilogue 


Lie algebras, physics and mathematics 


I hope that the reader has come to the end of this book with an enhanced ap- 
preciation for the mathematics of Lie algebras and its application to particle 
physics. This mathematics is a jewel — a crystalline treasure to appreciate 
for all time. The physics to which it is applied, however, is not such an unal- 
loyed and eternal beauty. In physics, unlike mathematics, we are constantly 
pulled in opposite directions. At one pole, there is unification, simplicity and 
elegance — the Platonic ideal of Nature that is created by and creates mathe- 
matics. At the other, there is the marvelous chaos of this particular world — 
messy, contingent, and constantly evolving with our experimental ability to 
probe its richness. Good physics must embrace these antipodes. That is what 
makes it so much fun! 
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